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Preface 


Fermat’s problem, also called Fermat’s last theorem, has attracted the 
attention of mathematicians for more than three centuries. Many clever 
methods have been devised to attack the problem, and many beautiful 
theories have been created with the aim of proving the theorem. Yet, despite 
all the attempts, the question remains unanswered. 

The topic is presented in the form of lectures, where I survey the main 
lines of work on the problem. In the first two lectures, there is a very brief 
description of the early history, as well as a selection of a few of the more 
representative recent results. In the lectures which follow, I examine in suc- 
cession the main theories connected with the problem. The last two lectures 
are about analogues to Fermat’s theorem. 

Some of these lectures were actually given, in a shorter version, at the 
Institut Henri Poincaré, in Paris, as well as at Queen’s University, in 1977. 

I endeavoured to produce a text, readable by mathematicians in general, 
and not only by specialists in number theory. However, due to a limitation 
in size, I am aware that certain points will appear sketchy. 

Another book on Fermat’s theorem, now in preparation, will contain a 
considerable amount of the technical developments omitted here. It will 
serve those who wish to learn these matters in depth and, I hope, it will 
clarify and complement the present volume. 

It is for me gratifying to acknowledge the help and encouragement I 
received in the preparation ofthis book: A. J. Coleman and the Mathematics 
Department at Queen’s University—for providing excellent working con- 
ditions; E. M. Wight--for her dilligent and skillful typing of the manuscript; 
G. Cornell—who read the book and helped very much in improving the 
style; The Canada Council—for partial support; C. Pisot and J. Oesterle— 
who arranged for my lectures at the Institut Henri Poincaré. 


Vili Preface 


It is also my pleasure to report here various suggestions, criticisms and 
comments from several specialists, whom I consulted on specific points or 
to whom I have sent an earlier typescript version of this book. In alphabetical 
order: A. Baker, D. Bertrand, K. Inkeri, G. Kreisel, H. W. Lenstra Jr., J. M. 
Masley, M. Mendés-France, B. Mazur, T. Metsänkylä, A. Odlyzko, K. 
Ribet, A. Robert, P. Samuel, A. Schinzel, E. Snapper, C. L. Stewant, 
G. Terjanian, A. J. van der Poorten, S. S. Wagstaff, M. Waldschmidt, 
L. C. Washington. 


Kingston, March, 1979 Paulo Ribenboim 
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LECTURE I 


The Early History of 
Fermat’s Last Theorem 


1. The Problem 


Pierre de Fermat (1601-1665) was a French judge who lived in Toulouse. 
He was a universal spirit, cultivating poetry, Greek philology, law but mainly 
mathematics. His special interest concerned the solutions of equations in 
integers. 

For example, Fermat studied equations of the type 


X? — dY? = +1, 


where d is a positive square-free integer (that is, without square factors 
different from 1) and he discovered the existence of infinitely many solutions. 
He has also discovered which natural numbers n may be written as the sum 
of two squares, namely those with the following property: every prime factor 
p of n which is congruent to 3 modulo 4 must divide n to an even power. 

In the margin of his copy of Bachet’s edition of the complete works of 
Diophantus, Fermat wrote: 


It is impossible to separate a cube into two cubes, or a biquadrate into two 
biquadrates, or in general any power higher than the second into powers of 
like degree; I have discovered a truly remarkable proof which this margin is 
too small to contain. 


This copy is now lost, but the remark appears in the 1670 edition of the 
works of Fermat, edited in Toulouse by his son Samuel de Fermat. It is 
stated in Dickson’s History of the Theory of Numbers, volume II, that 
Fermat’s assertion was made about 1637. Tannery (1883) mentions a letter 
from Fermat to Mersenne (for Sainte-Croix) in which he wishes to find two 
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cubes whose sum is a cube, and two biquadrates whose sum is a biquadrate. 
This letter appears, with the date June 1638, in volume 7 of Correspondance 
du Pere Marin Mersenne (1962); see also Itard (1948). The same problem was 
proposed to Frénicle de Bessy (1640) in a letter to Mersenne, and to Wallis 
and Brouncker in a letter to Digby, written in 1657, but there is no mention 
of the remarkable proof he had supposedly found. 

In modern language, Fermat’s statement means: 


The equation X" + Y" = Z", where n is a natural number larger than 2, 
has no solution in integers all different from 0. 


No proof of this statement was ever found among Fermat’s papers. He 
did, however, write a proof that the equations X* — Y* = Z? and X4 + Y* = 
Z* have no solutions in integers all different from 0. In fact, this is one 
of two proofs by Fermat in number theory which have been preserved’, 
With very few exceptions, all Fermat’s other assertions have now been 
confirmed. So this problem is usually called Fermat’s last theorem, despite 
the fact that it has never been proved. 

Fermat’s most notable erroneous belief concerns the numbers P, = 
2?" + 1, which he thought were always prime. But Euler showed that F5 
is not a prime. Sierpifiski and Schinzel pointed out some other false assertions 
made by Fermat. 

Mathematicians have debated whether Fermat indeed possessed the proof 
of the theorem. Perhaps, at one point, he mistakenly believed he had found 
such a proof. Despite Fermat’s honesty and frankness in acknowledging 
imperfect conclusions, it is very difficult to understand today, how the most 
distinguished mathematicians could have failed to rediscover a proof, if one 
had existed. 

To illustrate Fermat’s candor, we quote from his letter of October 18, 
1640 to Frénicle de Bessy: 


Mais je vous advoue tout net (car par advance je vous advertis que comme 
je suis pas capable de m’attribuer plus que je nesgay, je dis avec même franchise 
ce que je ne sgay pas) que je nay peu encore démonstrer I’exclusion de tous 
diviseurs en cette belle proposition que je vous avois envoyée, et que vous 
m’avez confirmée touchant les nombres 3, 5, 17, 257, 65537 & c. Car bien que 
je réduise l’exclusion a la plupart des nombres, et que jaye même des raisons 
probables pur le reste, je nay peu encore démonstrer nécessairement la 
vérité de cette proposition, de laquelle pourtant je ne doute non plus a cette 
heure que je faisois auparavant. Si vous en avez la preuve assurée, vous 
m’obligerez de me la communiquer: car apres cela rien ne m’arrestera en ces 
matières. 


1 The other proof, partial but very interesting, was brought to light and reproduced by Hofmann 
(1943, pages 41-44). Fermat showed that the only solutions in integers of the system x = 2y? -1, 
x? = 2z? — l are x = l and x = 7. 
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Again, in a letter to Pascal from August 29, 1654, Fermat proposes the 
same problem: 


Au reste, il west rien à lavenir que je ne vous communique avec toute 
franchise. Songez cependant, si vous le trouvez à propos, à cette proposition: 
les puissances carrées de 2, augmentées de l'unité, sont toujours des nombres 
premiers: 2? + 1 = 5, 27 +1 = 17, 2? +1 = 257, 2? + 1 = 65537, sont 
premiers, et ainsi à l'infini. C’est une proposition de la verité de laquelle je 
vous répond. La démonstration en est trés malaisée, et je vous avoue que je 
wai pu encore la trouver pleinement; je ne vous la proposerois pas pour la 
chercher si j’en étois venu a bout. 


Incidentally Pascal has written to Fermat stating: 


Je vous tiens pour le plus grand géomètre de toute l’Europe. 


It is also highly improbable that Fermat would have claimed to have 
proved his last theorem, just because he succeeded in proving it for a few 
small exponents. 

In contrast, Gauss believed that Fermat’s assertions were mostly extra- 
polations from particular cases. In 1807, Gauss wrote: “Higher arithmetic 
has this special feature that many of its most beautiful theorems may be 
easily discovered by induction, while any proof can be only obtained with 
the utmost difficulty. Thus, it was one of the great merits of Euler to have 
proved several of Fermat’s theorems which he obtained, it appears, by 
induction”. 

Even though he himself gave a proof for the case of cubes, Gauss did not 
hold the problem in such high esteem. On March 21, 1816, he wrote to 
Olbers about the recent mathematical contest of the Paris Academy on 
Fermat’s last theorem: 


I am very much obliged for your news concerning the Paris prize. But I 
confess that Fermat’s theorem as an isolated proposition has very little 
interest for me, because I could easily lay down a multitude of such proposi- 
tions, which one could neither prove nor dispose of. 


In trying to prove Fermat’s theorem for every positive integer n > 3, I 
make the following easy observation. If the theorem holds for an integer m 
and n = Im is a multiple of m, then it holds also for n. For, if x, y, z are non- 
zero integers and x” + y" = z" then (x')" + (y')" = (z'", contradicting the 
hypothesis. Since every integer n > 3 is a multiple of 4 or of a prime p ¥ 2, 
it suffices to prove Fermat’s conjecture for n = 4 and for every prime p # 2. 
However, I shall occassionally also mention some proofs for exponents 
of the form 2p, or p" where p is an odd prime. 

The statement of Fermat’s last theorem is often subdivided further into 
two cases: 

The first case holds for the exponent p when there do not exist integers 
x, y, z such that py xyz and x? + y? = z’. 




























































































A I The Early History of Fermat’s Last Theorem 


The second case holds for the exponent p when there do not exist integers 
x, y, z, all different from 0, such that p|xyz, gced(x,y,z) = 1 and x? + y? = 2’. 


2. Early Attempts 


It was already known in antiquity that a sum of two squares of integers may 
well be the square of another integer. Pythagoras was supposed to have 
proven that the lengths a, b, c of the sides of a right-angle triangle satisfy the 
relation 

a? +b? =c?; 


so the above fact just means the existence of such triangles with sides mea- 
sured by integers. 

But the situation is already very different for cubes, biquadrates and so on. 
Fermat’s proof for the case of biquadrates is very ingenious and proceeds 
by the method which he called infinite descent. Roughly, it goes as follows: 
Suppose a certain equation f(X,Y,Z) = 0 has integral solutions a, b, c, with 
c > 0, the method just consists in finding another solution in integers a’, b’, c 
with 0 < ¢' < c. Repeating this procedure a number of times, one would 
reach a solution a”, b”, c”, with 0 < c” < 1, which is absurd. This method of 
infinite descent is nothing but the well-ordering principle of the natural 
numbers. 

Little by little Fermat’s problem aroused the interest of mathematicians 
and a dazzling array of the best minds turned to it. 

Euler considered the case of cubes. Without loss of generality, one may 
assume x? + y? = z? where x, y, z are pairwise relatively prime integers, 
x, y are odd, so x =a—b, y=a+b. Then x + y= 2a, x? = xy +y = 
a? + 3b? and z? = x? + y? = 2a(a? + 3b”), where the integers 2a, a? + 3b? 
are either relatively prime or have their greatest common divisor equal to 3. 
Euler was led to studying odd cubes a? + 3b? (with a, b relatively prime), 
and forms of their divisors; he concluded the proof by the method of infinite 
descent. The properties of the numbers a? + 3b? which were required had 
to be derived from a detailed study of divisibility, and therefore were omitted 
from the proof published in Euler’s book on algebra (1822). This proof, with 
the same gap, was reproduced by Legendre. Later, mathematicians intrigued 
by the missing steps were able without much difficulty, to reconstruct the 
proof on a sound basis. In today’s language, numbers of the form a? + 3b? 
are norms of algebraic integers of the quadratic extension Q(/—3) of the 
rational field Q and the required properties can be deduced from the unique 
factorization theorem, which is valid in that field. 

Gauss gave another proof for the case of cubes. His proof was not 
“rational” since it involved complex numbers, namely those generated by 
the cube root of unity ¢ = (— 1 + /—3)/2, i.e., numbers from the quadratic 
field @(,/— 3). He consciously used the arithmetic properties of this field. The 
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l underlying idea was to call “integers” all numbers of the form (a + b,/—3)/2 
where a, b are integers of the same parity; then to define divisibility and the 
prime integers, and to use the fact that every integer is, in a unique way, the 
product of powers of primes. Of course some new facts appeared First 
the integers +¢, +£? that divide 1 are “units” since te? = 1 and therefore 
should not be taken into account so to speak, in questions of divisibilit 
Thus, all the properties have to be stated “up to units”. Secondly, the uni e 
factorization, which was taken for granted, was by no means eet ce 
fact it turned out to be false in general. I shall return to this later. 

Gauss’s proof was an early incursion into the realm of number fields. i.e 
those sets of complex numbers obtained from the roots of polynomials by 
the operations of addition, subtraction, multiplication, and division d 

l In the 1820s a number of distinguished French and German St 
ticlans were trying intensively to prove Fermat’s theorem. 

In 1825, G. Lejeune Dirichlet read at the Académie des Sciences de Paris 
a paper where he attempted to prove the theorem for the exponent 5. In fact 
his proof was incomplete, as pointed out by Legendre, who provided an 
independent and complete proof. Dirichlet then completed his own proof, 
which was published in Crelle Journal, in 1828. Be 

Dirichlet’s proof is “rational”, and involves numbers of the form a? — 5b? 
He carefully analyzed the nature of such numbers which are Sth powers 
when either a, b are odd, or a, b have different parity, and 5 does not divide a 
5 divides b, and a, b are relatively prime. Nowadays the properties he derived 


` can be obtained from the arithmetic of the field Q(,/5). In this field too 


a integer has a unique factorization. Moreover every unit is a power of 
Ge + SEN is of crucial importance in the proof. Of course, for 
irichlet this knowledge took the form of numeri i i hi 

cal manipul 
lead to the same result. Be 


In 1832 Dirichlet settled the theorem for the exponent 14. 
N The next important advance was due to Lamé, who, in 1839 proved the 
eorem for a = 7. Soon after, Lebesgue simplified Lamé’s proof consider- 


ably by a clever use of the identity, 
(X+Y+Z)’-(X74Y74+Z’) 
=X + Y)(X + ZXY +Z) 
x [(X? + Y? + Z? + XY + XZ + YZ? + XYZX +Y + Z)] 
already considered by Lamé. 


T See special cases of small exponents were being studied, a 

remarkable theorem was proved by Sophi i , 

Wer y Sophie Germain, a French 
Previously Barlow, and then Abel, had indicated interesting relations that 


x, y, z must satisfy if x? + y? = z? (and x 
5 l , y, z are not zero). Through clever 
manipulations, Sophie Germain proved: i 2 


If p is an odd prime such that 2 i 
p + 1 is also a prime then th 
Fermat's theorem holds for p. P D 
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These results were communicated by letter to Legendre and Cauchy since 
the regulations of the Academy prevented women from presenting the dis- 
coveries in person. 

There are many primes p for which 2p + 1 is also prime, but it is still not 
known whether there are infinitely many such primes. 

Following Sophie Germain’s ideas, Legendre proved the following theo- 
rem: Let p, q be distinct odd primes, and assume the following two conditions: 


1. p is never congruent modulo q to a pth power. 
2. the congruence X? + Y? + Z?=0 (mod q) has no solution x, y, z, unless 


q divides xyz. 


Then the first case of Fermat’s theorem holds for p. With this result, 
Legendre extended Sophie Germain’s theorem as follows: 


If pis a prime such that 4p + 1, 8p + 1, 10p + 1,14p + 1, or 16p + 1 is also 
a prime then the first case of Fermat's theorem holds for the exponent p. 


This was sufficient to establish the first case for all prime exponents 
p < 100. 


3. Kummer’s Monumental Theorem 


By 1840, Cauchy and Lamé were working with values of polynomials at 
roots of unity, trying to prove Fermat’s theorem for arbitrary exponents. 
Already in 1840 Cauchy published a long memoir on the theory of numbers, 
which however was not directly connected with Fermat’s problem. In 1847, 
Lamé presented to the Academy a “proof” of the theorem and his paper was 
printed in full in Liouville’s journal. However, Liouville noticed that the 
proof was not valid, since Lamé had tacitly assumed that the decomposition 
of certain polynomial expressions in the nth root of unity into irreducible 
factors was unique. 

Lamé attributed his use of complex numbers to a suggestion from Liouville, 
while Cauchy claimed that he was about to achieve the same results, given 
more time. Indeed, during that same year, Cauchy had 18 communications 
printed by the Academy on complex numbers, or more specifically, on 
radical polynomials. He tried to prove what amounted to the euclidean 
algorithm, and hence unique factorization for cyclotomic integers. Then, 
assuming unique factorization, he drew wrong conclusions. Eventually 
Cauchy recognized his mistake. In fact, his approach led to results which 
were later rediscovered by Kummer with more suitable terminology. A 
noteworthy proposition of Cauchy was the following one, (C. R. Acad. Sci. 
Paris, 25, 1847, page 181) later also found by Genocchi and by Kummer: 
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If the first case of Fermat’s theorem fails for the exponent p, then the sum 


Ki -4 
1P74 4 2Pt (25- “y 


is a multiple of p. 
By the year 1847, mathematicians were aware of both the subtlety and 


importance of the unique decomposition of cyclotomic integers into ir- 
reducible factors. 

In Germany, Kummer devoted himself to the study of the arithmetic of 
cyclotomic fields. Already, in 1844, he recognized that the unique factoriza- 
tion theorem need not hold for the cyclotomic field Q(¢,). The first such case 
occurs for p = 23. However, while trying to rescue the unique factorization 
he was led to the introduction of new “ideal numbers”. Here is an excerpt 
of a letter from Kummer to Liouville (1847): S 


... Encouraged by my friend Mr. Lejeune Dirichlet, I take the liberty of 
sending youa few copies of a dissertation which I have written three years ago, 
at the occasion of the century jubileum of the University of Königsber, as 
well as of another dissertation of my friend and student Mr. Ee a 
young and distinguished geometer. In these memoirs, which I beg you ‘to 
accept as a sign of my deep esteem, you will find developments concernin 
certain points in the theory of complex numbers composed of roots of unit S 
i.e., roots of the equation r” = 1, which have been recently the subject of er 
discussions at your illustrious Academy, at the occasion of an attempt b 
Mr. Lamé to prove the last theorem of Fermat. SE 

Concerning the elementary proposition for these complex numbers, that 
a composite complex number may be decomposed into prime factors in only one 
way, which you regret so justly in this proof, which is also lacking in some 


> y 
SC points I may assure you that Di does not hold in general for complex 


de + air + ar? +--+ H apar", 


but it is possible to rescue it, by introducing a new kind of complex numbers 
which I have called ideal complex number. The results of my research on this 
See nn been communicated to the Academy of Berlin and printed in 
a a (March 1846); a memoir on the same subject will appear 
en relle A ournal. I have considered already long ago the applications 
ey ory to the proof of Fermat’s theorem and I succeeded in deriving 

€ impossibility of the equation x" + y" = z" from two properties of the 
Prime number n, so that it remains only to find out whether these properties 
are shared by all prime numbers. In case these results seem worth some of 


your attention, you may find them published i i i 
Baan Aa published in the Sitzungsberichte of the 


The theorem which K i i 
ummer mentioned in this letter repres 
Ee over all his predecessors. SE 
Es he ideal numbers correspond to today’s divisors. Dedekind rephrased 
concept, introducing the ideals, which are sets I of algebraic integers of 
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the cyclotomic field such that 0 e I; if, Be I then « + B,a— Pel; ifael 
and ß is any cyclotomic integer then af e I. Ideals may be multiplied in a 
very natural way. 

Each cyclotomic integer « determines a principal ideal consisting of all 
elements Ba, where f e A, the set of cyclotomic integers. 

If all ideals are principal there is unique factorization in the cyclotomic 
field, and conversely. For the cases when not all ideals are principal, Kummer 
wanted to “measure” to what extent some of the ideals were not principal. 
So he considered two nonzero ideals I, I' equivalent when J’ consists of all 
multiples of the elements of I by some nonzero element « in the cyclotomic 
field. Thus, there is exactly one equivalence class when all ideals are principal. 
Kummer proved that there are only finitely many equivalence classes of 
ideals in each cyclotomic field OC A 

Let h, denote the number of such classes. If p does not divide h, then p is 
said to be a regular prime. In this case, if the ideal J? is a principal ideal then 
I is itself a principal ideal. But the main property used by Kummer is the 
following lemma: 


If p is a regular prime, p # 2, if wis a unit in the ring A of cyclotomic integers 
of Q(6,), and if there exists an ordinary integer m such that o — me AQ — OP", 
then o is the pth power of another unit. 


The proof of this lemma requires deep analytical methods. 

Armed with this formidable weapon, Kummer proved that Fermat’s last 
theorem holds for every exponent p which is a regular prime. This is the 
theorem which Kummer mentioned in his letter to Liouville. At first Kummer 
believed that there exist infinitely many regular primes. But, he later realized 
that this is far from evident—and in fact, it has, as yet, not been proved. 

A well-known story concerning a wrong proof of Fermat’s theorem 
submitted by Kummer, originates with Hensel. Specifically, in his address 
to commemorate the first centennial of Kummer’s birth, Hensel (1910) stated: 


Although it is not well known, Kummer at one time believed he had found 
a complete proof of Fermat’s theorem. (This is attested to by reliable witnesses 
including Mr. Gundelfinger who heard the story from the mathematician 
Grassmann.) Seeking the best critic for his proof, Kummer sent his manuscript 
to Dirichlet, author of the insuperably beautiful proof for the case 4 = 5. 
After a few days, Dirichlet replied with the opinion that the proof was excellent 
and certainly correct, provided the numbers in « could not only be decomposed 
into indecomposable factors, as Kummer proved, but that this could be done 
in only one way. If however, the second hypothesis couldn't be satisfied, most 
of the theorems for the arithmetic of numbers in « would be unproven and 
the proof of Kummer’s theorem would fall apart. Unfortunately, it appeared 
to him that the numbers in « didn’t actually possess this property in general. 


This is confirmed in a letter, which is not dated (but likely from the summer 
of 1844), written by Eisenstein to Stern, a mathematician from Gottingen. 
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In a recent paper, Edwards (1975) analyzes this information, in the light 
of a letter from Liouville to Dirichlet and expresses doubts about the exis- 
tence of such a “false proof” by Kummer. 


4. Regular Primes 


To decide whether a prime is regular it is necessary to compute the number 
of equivalence classes of ideals of the cyclotomic field. Kummer succeeded 
in deriving formulas for the class number h, which were good enough to 
allow an explicit computation for fairly high exponents p. In this way, he 
discovered that 37, 59, 67 were irregular primes—actually these are the only 
ones less than 100. 

One of the most interesting features in this study was the appearance of 
the Bernoulli numbers. In the derivation of the class number formula, there 
was an expression of the type 


1k 4 2k +--+ 4 (n— 1 


which had to be computed for large values of k and n. First it is easy to show 
that there is a unique polynomial S,(X) with rational coefficients of degree 
k + 1, having leading coefficient 1/(k + 1) and such that for every n > 1 its 
value is S,(n) = 1* + 2*4---+n* These polynomials can be determined 
recursively and may be written as follows: 


k+1 k 
(e+ nsn = xe —( Joar "uer (ar 


Të coefficients B,, B,,..., B, had already been discovered by Bernoulli. 
n fact Euler had already studied these numbers and found that they can be 
generated by considering the formal inverse of the series 











EEE Poa La 
EE Tae E 
namely 
X P, B, B 
= 2 3 
ve L+ ta haa eia 


This series appears i ansi i 

cot x = e Se d eer ae expansion of the cotangent function: 

SE GG easily seen that B, = 0 for every odd k, k #1. The first Bernoulli 

a B, = ~1/2, B, = 1/6, By = —1/30, Bs = 1/42, B = — 1/30, 

Bio = /66, Bi, = —691/2730, Big = 7/6, Bis = —3617/510, Bis = 
867/798. The numerators grow quickly, for example: 





2577687858367 
B34 = --— 6 : 
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Bernoulli numbers have fascinating arithmetical properties, but I have 
to refrain from describing them. I will just mention their relation with 
Riemann’s zeta-function {(s) = Y7- ı (1/n*) (for s > 1). The following formula 
holds: 
2(2k)! 
(2n)?* 





By =(-1)"' CO (for k > 1). 

Through his studies of the class number formula, Kummer showed that 
a prime number p is regular if and only if p does not divide the numerators 
of the Bernoulli numbers B,, By,..., B,-3- 

From the data he acquired, it was reasonable to conjecture that there are 
infinitely many regular primes, at least they seemed to appear more frequently 
than the irregular primes. Yet, this has never been proved and appears to be 
extremely difficult. Paradoxically, Jensen proved in 1915, in a rather simple 
way that there are in fact infinitely many irregular primes. 

This was the situation around 1850. The theorem was proved for regular 
primes, the Bernoulli numbers had entered the stage and the main question 
was how to proceed in the case of irregular primes. 


5. Kummer’s Work on Irregular Prime Exponents 


In 1851 Kummer began examining the irregular prime exponents. Aiming 
to derive congruences which must be satisfied if the first case fails, he produced 
some of his deepest results on cyclotomic fields. 

It is impossible to describe in a short space Kummere highly technical 
considerations, but the main points, which we mention here, give at least 
some idea of his astonishing mastery. First, he carefully studied the periods 
of the cyclotomic polynomial 


D(X) SP +X? a ed, 


Suppose q is a prime number, q # p, f is the order of q modulo p, p — 1 = fr, 
and let g bea primitive root modulo p, and¢a primitive pth root of 1. Kummer 
considered the r periods of f terms each no, 41,-- -> Nr- 1 (already defined and 
used by Gauss). For example yo = ¢ + [9 + [#" ++ (97°, the other pe- 
riods being conjugate to yo. If A is the ring of cyclotomic integers, and A’ is the 
ring of integers of the field K’ = Q(yo) = = Q(nr-1); Kummer showed 
that A is a free module over A’, with basis {1,C,...,f/~*}, and A’= 
Z{no.----sMy-1] is a free abelian group with basis (n0,1M1,- - -Mr—1}- He also 
studied the decomposition of the prime q in the ring A’. 

Then, Kummer gave his beautiful proof that the group of classes of ideals 
of the cyclotomic field is generated by the classes of the prime ideals with 
prime norm. 
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Another ingredient in his work was the use of the cyclotomic functions 
first introduced by Jacobi. If a is an odd prime of the form q = kp + 1, if h 
is a primitive root mod a, ¢ a primitive pth root of 1 and y a primitive qth 
root of 1, let 


q-1 
<p Sy Cy 
t=1 


where ind,(z), the index of t (with respect to h, q) is the only integer s, 1 < s < 
q — 1 such that t = h'(mod q). it 
For every integer de Z, let 
q-2 
Yalt) Ber > Cindn(t) "Mr 1) indy (t+ 1) 
t=1 
If Q is the ideal of A generated by q and h* — ¢ (where q = kp + 1) then of 
course Q is a prime ideal of norm q, that is, Ag = [ ]P-2 o(Q) (where o is a 
generator of the Galois group). The main results concern certain products 
of conjugates of Q which are principal ideals: 
p-2 
ALE = [] A, 
i=0 


with ge = g’(mod p), n = (p — 1)/2 and if 
I,= {i)0<i<p—2,g,-;+ In-itindgid) > D 


Apak) = TI FO). 


iela 


then 


All this was put together to give Kummer his congruences. If x, y, z are 


pei relatively prime integers, not multiples of p, such that x? + y? + z? = 
, then 


(Az)? = A(x? + yP) = Alx + y) T] Ais + Det 
k=0 


where g is a primitive root modulo i 
p. The ideals A(x + y), A(x + C*y) are 

pth powers of ideals, say A(x + y) = JB, A(x + (“y) = J? (Ją being a con- 
m of J o). For every d,1 < d < p — 2, and I, defined as before, The 1,7 (J) 
Sa principal ideal, say Aw, where a = F(£), F(X) being a polynomial with 
Coefficients in Z and degree at most p — 2. Then 
II (x + Xy) = X"(F(X))? + &,(X)M(X), 
iela 
where M(X) e Z[X]. 
T Eng these polynomials as functions of the real variable t > 0, 

ing t = e” and taking an appropriate branch of the logarithm we obtain: 


X log(x + ey) = mv + plog Fle’ | TAE IME 
iela ELISE EEE ert Fier y 
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Let D"G denote the nth derivative of G(v), at v = 0. Kummer showed for 
2s = 2,4,...,p—3 (p # 2,3) that the following congruences are satisfied: 


[D?-**log(x + e’y)]B2s = 0 (mod p), 


where B,, is the Bernoulli number of index 2s. 

Since D/log(x + e’y) = R,(x,y)/(x + yy, where R,(X, Y) is a homogeneous 
polynomial of total degree j, multiple of Y, writing R,(X,Y) = X/P,(T), it 
follows that 

P,-2sOB:; = 0 (mod p) 


for 2s = 2,4,...,p—3. 
The polynomials P,(T) may be computed recursively. With these con- 
gruences, Kummer improved his previous result: 


If p divides the numerator of at most one of the Bernoulli numbers 
By, By,..., B,- 3, then the first case of Fermat’s theorem holds for p. 


In 1905 Mirimanoff generalized this last result of Kummer, as follows: 


If p does not divide the numerator of one of the four Bernoulli numbers 
B,-3, Bp- 55 Bp—7, Ba, then the first case holds for the prime p. 


This theorem is again a tour de force. However, due to the long com- 
putations involving large Bernoulli numbers, its applicability is limited. 

It was becoming increasingly clear that new and significantly more 
powerful methods were necessary to provide any substantial progress. 

Later, I shall describe the sensational work by Wieferich and Mirimanoff 
early this century, and how Furtwängler used class field theory (more 
specifically Eisenstein’s reciprocity law for the power residue symbol) to 
improve and simplify these results. All this brought into the battle the newly 
created forces of class field theory. 


6. Other Relevant Results 


In 1856, Grünert considered the size of possible solutions of Fermat’s 
equation. 

He proved that if x, y, z are nonzero integers such that x” + y" = 2”, with 
0<x<y<z, then necessarily x > n. This was very easy to prove. 

For example, if p = 101 the smallest nontrivial solution, if it exists, would 
involve numbers greater than 102!°'. This pointed to a fact which was 
becoming more and more apparent: In order to disprove Fermat’s statement 
one has to deal with very large numbers. 

In 1894, following the line of Sophie Germain, Wendt contributed an 
interesting theorem. He considered the determinant W, of the circulant 
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matrix 


O ÀQ- 


which is equal to | [7=$[(1 + é)" — 1], where &, = 1 
nth roots of 1. g l ‘e SE 
Wendt proved: 


If p is an odd prime, if there exists h > 1 such that q = 2hp + 1 is prime, if 


q does not divide W,, and p™ #1 (mod al, then the first ? 
conjecture holds for p. 2 EE 


A first step in the proof is the following: i i ivisi 
g: if x, y, z are integers not divisible 
SE ER + y? + z? = 0 (mod q) then q divides W,,,. 
is leads to the interesting and related problem: i i 
does the congruence g EE 


X? + YP + Z’ = 0 (mod al 


ar a solution in integers x, y, z not multiples of q? Of course this depends 
D q. 

A Se GEN P, ay exist infinitely many primes q such that the above con- 

ence doe. i indi ; 
re s not have a solution as indicated, then Fermat’s theorem would 
i an in 1909, een showed that this hypothesis is false. More precisely, 
S l Fi (p — Dip — 2)? + 6p — 2 then the above congruence modulo q has 
a olution. In the same year, Hurwitz generalized this theorem, in a very 
autiful paper, by counting the number of solutions of 


a, X + aX} +--+ + aX? = 0 (mod q). 


All these considerations led again to deep investigations of the number of 


zeros of polynomials over finite fields, event inki i 
; ually 1 i 
hypothesis for function fields. RE 
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Cauchy wrote the following report: 


Eleven memoirs have been presented to the Secretary. But none has solved 
the proposed question. The Commissaries have nevertheless noted that the 
piece registered under number 2 contained a new solution of the problem in 
the special case developed by Fermat himself, namely when the exponent is 
equal to 4. 

Thus, after being many times put for a prize, the question remains at the 
point where M. Kummer left it. However, the mathematical sciences should 
congratulate themselves for the works which were undertaken by the ge- 
ometers, with their desire to solve the question, specially by M. Kummer; and 
the Commissaries think that the Academy would make an honourable and 
useful decision if, by withdrawing the question from the competition, it 
would adjugate the medal to M. Kummer, for his beautiful researches on the 
complex numbers composed of roots of unity and integers. 


In 1908 the very substantial Wolfskehl Prize, in the amount of 100,000 
Mark, was offered with the same aim by the Königliche Gesellschaft der 


Wissenschaften, in Göttingen, Germany: 


By the power conferred on us, by Dr. Paul Wolfskehl, deceased in 
Darmstadt, hereby we fund a prize of one hundred thousand Marks, to be 
given to the person who will be the first to prove the great theorem of Fermat. 

In his will, Doctor Wolfskehl observed that Fermat (Oeuvres, Paris, 1891, 
volume I, p. 291, observation 2) asserted mutatis mutandis that the equation 
x? + y? = z has no integral solutions for any odd prime number 4. This 
theorem has to be proved, either following the ideas of Fermat, or completing 
the researches of Kummer (Crelle’s Journal, vol. XL, page 130; Abhandlungen 
der Akademie der Wissenschaften zu Berlin, 1857), for all exponents 4, for 
which it has some meaning [consult Hilbert, Theorie der Algebraischen 
Zahlkörper, 1894-1895, and Enzyklopädie der Mathematischen Wissenschaften, 
(1900-1904), IC 4b, page 713]. 

The following rules will be followed: 

_ The Königliche Gesellschaft der Wissenschaften in Göttingen will decide 
in entire freedom to whom the prize should be conferred. It will refuse to 
accept any manuscript written with the aim of entering the competition to 
obtain the Prize. It will only take in consideration those mathematical memoirs 
which have appeared in the form ofa monograph in the periodicals, or which 
are for sale in the bookstores. The Society asks the authors of such memoirs 
to send at least five printed exemplars. 

Works which are published in a language which is not understood by the 
scholarly specialists chosen for the jury will be excluded from the competition. 
The authors of such works will be allowed to replace them by translations, of 
guaranteed faithfulness. 

The Society declines its responsibility for the examination of works not 
brought to its attention, as well as for the errors which might result from the 
fact that the author of a work, or part of a work, are unknown to the Society. 

The Society keeps the right of decision in the case where various persons 
would have dealt with the solution of the problem, or for the case where the 
solution is the result of the combined efforts of several scholars, in particular 
in what concerns the partition of the Prize, at its own discretion. 

The award of the Prize by the Society will take place not earlier than two 
years after the publication of the memoir to be crowned. The interval of time 
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is aimed to allow the German and foreign mathematicians to voice their 
opinion about the validity of the solution published. 

__As soon as the Prize will be conferred by the Society, the laureate will be 
informed by the secretary, on the name of the Society, and the result will be 
published everywhere the Prize would have been announced during the 
preceding year. The assignment of the Prize by the Society is not to be the 
subject of any further discussion. 

The payment of the Prize will be made to the laureate, in the next three 
months after the award, by the Royal Cashier of Göttingen University, or 
at the receivers own risk, at any other place he will have designated. 
The capital may be delivered against receipt, at the Society’s will, either 
in cash, or by the transfer of financial values. The payment of the Prize will be 
considered as accomplished by the transmission of these financial values, even 
though their total value at the day’s course would not attain 100,000 Mark. 

If the Prize is not awarded by September 13, 2007, no ulterior claim will 
be accepted. 

The competition for the Prize Wolfskehl is open, as of today, under the 
above conditions. 


Göttingen, June 27, 1908 
Die Königliche Gesellschaft der Wissenschaften. 


A memorandum dated 1958 states that the Prize of 100,000 DM has been 
reduced to approximately 7,600 DM, in virtue of the inflation and financial 
changes. 


Dr. F. Schlichting, from the Mathematics Institute of the University of 


Göttingen, was kind enough to provide me with the following information 
on the Wolfskehl Prize: 


Göttingen, March 23, 1974. 


Dear Sir: 
‚Please excuse the delay in answering your letter. I enclose a copy of the 
original announcement, which gives the main regulations, and a note of the 
Akademie” which is usually sent to persons who are applying for the prize. 
now worth a little bit more than 10,000 DM. There is no count of the total 
number of “solutions” submitted so far. In the first year (1907-1908) 621 
solutions were registered in the files of the Akademie, and today they have 
stored about 3 meters of correspondence concerning the Fermat problem. 
In recent decades it was handled in the following way: the secretary of the 
Akademie divides the arriving manuscripts into (1) complete nonsense, which 
is sent back immediately, and into (2) material which looks like mathematics. 
The second part is given to the mathematical department and there, the work 
of reading, finding mistakes and answering is delegated to one of the scientific 
assistants (at German universities these are graduated individuals working 
for Ph.D. or habilitation and helping the professors with teaching and 
Supervision)—at the moment I am the victim. There are about 3 to 4 letters 
to answer per month, and there is a lot of funny and curious material arriving 
e.g., like the one sending the first half of his solution and promising the second 
if we would pay 1000 DM in advance; or another one, who promised me 10 
per cent of his profits from publications, radio and TV interviews after he 
got famous, if only I would support him now; if not, he threatened to send 
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it to a Russian mathematics department to deprive us of the glory of dis- 
covering him. From time to time someone appears in Göttingen and insists 
on personal discussion. 

Nearly all “solutions” are written on a very elementary level (using the 
notions of high school mathematics and perhaps some undigested papers in 
number theory), but can nevertheless be very complicated to understand. 
Socially, the senders are often persons with a technical education but a failed 
career who try to find success with a proof of the Fermat problem. I gave 
some of the manuscripts to physicians who diagnosed heavy schizophrenia. 

One condition of Wolfskehl’s last will was that the Akademie had to 
publish the announcement of the prize yearly in the main mathematical 
periodicals. But already after the first years the periodicals refused to print 
the announcement, because they were overflowed by letters and crazy 
manuscripts. So far, the best effect has been had by another regulation of the 
prize: namely, that the interest from the original 100,000 Mark could be used 
by the Akademie. For example, in the 1910s the heads of the Göttingen 
mathematics department (Klein, Hilbert, Minkowski) used this money to 
invite Poincaré to give six lectures in Göttingen. 

Since 1948 however the remainder of the money has not been touched. 

I hope that you can use this information and would be glad to answer any 
further questions. 


Yours sincerely, 
F. Schlichting. 
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LECTURE U 


Recent Results 


Some of the most common questions I have been asked are: 


a. For which exponents is Fermat’s theorem true? 
b. Is serious work still being done on the problem? 
c. Will it be solved soon? 


Anyone over 40, hearing my reply to the first question, will say: “When 
I was younger, we knew that it was true up to...” and will then state some 
rather small exponent. 

Below I will try to present whatever information I have gathered. I will 
not, however, attempt to answer the last question. 

There has always been considerable work done on the subject—though 
of rather diverse quality—so it is necessary to be selective. My purpose is to 
show the various methods of attack, the different techniques involved, and 
to indicate important historical developments. 

Here are 10 recent results which will later be discussed in more detail. 


1. Stating the Results 


1. Wagstaff (1976): Fermat’s last theorem (FLT) holds for every prime 
exponent p < 125000. 

2. Morishima and Gunderson (1948): The first case of FLT holds for every 
prime exponent p < 57 x 10° (or, at worst, as I will explain, for every 
prime exponent p< 3 x 10°, according to Brillhart, Tonascia and 
Weinberger, 1971). 

In fact the first case also holds for larger primes. 


3. The first case of FLT holds for the largest prime known today. 
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The above results are on the optimistic side. But some mathematicians 
think that there might be a counterexample. How large would the smallest 
counterexample have to be for a given exponent p? 


4. Inkeri (1953): Ifthe first case fails for the exponent p, if x, y, z are integers, 
O<x<y<2, py} xyz, x? + y’ = 2”, then 


2p3 p 
S Ken Sa 
log(3p) 


and us Ap 


And in the second case, 


-4 3p-1 


x > pr? 


Moreover, Pérez Cacho proved in 1958 that in the first case, y > 4(p?P + 1)’, 


where P is the product of all primes q # p such that q — 1 divides p — 1. 
There might also be only finitely many solutions. In this respect: 


5. Inkeri and Hyyrö (1964): (a) Given p and M > 0, there exist at most 
finitely many triples (x,y,z), such that 0 < x < y < z, x? + y? = 2?, and 
y=x,z—y<M. 

(b) Given p, there exist at most finitely many triples (x, y,z) such that 
0<x<y< z, x? + y’ = 2, and x is a prime power. 
For each such triple, cf. Inkeri (1976), we have the effective majoration 
(and this is a very important new feature): 


x < y < expexp{[2"(p — 1! De, 
Another sort of result, this time for even exponents is the following: 

6. Terjanian (1977): If x, y, z are nonzero integers, p is an odd prime, and 
x2P + y? = 2?” then 2p divides x or y. In other words, the first case of 
FLT is true for every even exponent. 

The possibility that FLT (or even its first case) holds for infinitely many 
prime exponents is still open. In this respect we have: 

7. Rotkiewicz (1965): If Schinzel’s conjecture on Mersenne numbers is 
true, then there exist infinitely many primes p such that the first case 


of FLT holds for p (Schinzel conjectured that there exist infinitely many 
square-free Mersenne numbers). 
The next results are intimately connected with the class group of the 
cyclotomic fields Q(£), where ¢ is a primitive pth root of 1. 
8. Vandiver (1929): If the second factor h* of the class number of OC) 


is not a multiple of p and if none of the Bernoulli numbers B,,, (n = 
1,2,....(p — 3)/2) is a multiple of p>, then Fermat’s last theorem holds 


for the exponent p. 
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9. Eichler (1965): If the first case fails for p, then p'v?!~! divides the first 
factor h* of the class number of Q(¢) and the p-rank of the ideal class 
group of OU) is greater than ,/p — 2. 

10. Brückner (1975): If the first case fails for p, then the irregularity index of 
p, ii(p) = #{k = 2,4,...,p — 3|p divides the Bernoulli number B,} 
satisfies 


ii(p) > ./p — 2. 


2. Explanations 


Now, I shall explain the significance of these various theorems and 
computations. 


Result (1). Wagstaff obtained his result with a computer, but what is the 
theory behind it? 

Kummer’s theorem asserts that FLT holds for the prime exponents p 
which are regular. A prime p is regular if p does not divide the class number 
h of the cyclotomic field OU), where ¢ is a primitive pth root of 1. Kummer 
showed that this is equivalent to p not dividing the first factor h* of the class 
number. Since the computation of the class number, or even of its first factor, 
is rather involved, and even more because the class number grows so rapidly 
with p, it was imperative to find a more amenable criterion. Kummer charac- 
terized the regular primes p by the condition: 


DY Bu for 2k = 2, 4,...,p— 3. 


Here B,, denotes the 2kth Bernoulli number. These are defined by the formal 
power series expansion 





They may be obtained recursively; moreover if n is odd, n > 3, then B, = 0. 
Vandiver gave a practical criterion to determine whether p is irregular, 
by means of the congruence 


Ap Zb + 3p=2k _ pk _ 1 
4k Bus 3 


P/6<a<p/4 





a**~! (mod p). 


The advantage of this congruence is that it involves a sum of relatively few 
summands, contrary to the previous congruences. If both the right-hand side 
and the left-hand factor of the above congruence are multiples of p then the 
above congruence does not decide the question and other similar congruences 
have to be used. Once it is known that p is irregular, the following criterion is 
used (Vandiver, 1954 and Lehmer, Lehmer, and Vandiver, 1954): 

Let p be an irregular prime, let P = rp + 1 bea prime such that P < p? — p 
and let be an integer such that t # 1 (mod P). If p|B>,, with 2 < 2k < p — 3 
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If Q%, # 1 (mod p) for all 2k such that p|B,,, then FLT holds for the ex- 
ponent p. This criterion is well suited to the computer. 

During his extensive calculations, Wagstaff noted many facts about the 
irregular primes. The maximum irregularity index found was 5. Moreover, 





, l ; 
+ (irregular primes p < 125000) _ 39248 œ 1 — -L = 0.39347. 
+ (primes p < 125000) Je 


This confirms a heuristic prediction of Siegel (1964). 

Let me now recall various interesting results about regular and irregular 
primes. 

It is suspected that there exist infinitely many regular primes, but this 
has never been proved. On the other hand, Jensen proved in 1915 that there 
exist infinitely many irregular primes. Actually they are abundant in the 
following sense. In 1975, Yokoi proved for N an odd prime, and Metsänkylä 
(1976), for arbitrary N > 3, that if H is a proper subgroup of the multiplicative 
group (Z/NZ)*, then there exist infinitely many irregular primes p such that p 
modulo N is not in H. 

Taking N = 12 and letting H be the trivial subgroup, gives the following 
puzzling theorem previously obtained by Metsänkylä (1971): There exist 
infinitely many irregular primes p which satisfy either one of the congruences 
p = 1 (mod 3), p = 1 (mod 4). But he couldn’t decide which of these con- 
gruence classes must contain infinitely many irregular primes. 

So it is rather startling that it is possible—and not too difficult—to show 
that there are infinitely many irregular primes, however, it is not known 
whether there are infinitely many regular ones, even though heuristic argu- 
ments seen to indicate that these are much more numerous. 

Among the many conjectures—and all seem difficult to prove—let me 
mention: 


1. There exist primes with arbitrarily large irregularity index. 
2. There exist infinitely many primes with given irregularity index. 
3. There exists a prime p and some index 2k such that p° | B ,,2 < 2k < p — 3. 


Result (2). The fact that the first case holds for all prime exponents less 
than 3 x 10° depends on the scarcity of primes p satisfying the congruence 
2?-1 = 1 (mod p°). 

Fermat’s little theorem says that if p is a prime and p ¥ m, then më Tel 
(mod p). Hence the quotient q,(m) = (mP”' — 1)/p is an integer. It is called 
the Fermat quotient of p with base m. 
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In 1909 Wieferich proved the following theorem: 


If the first case of FLT fails for the exponent p, then p satisfies the stringent 
condition that 2?~' = 1 (mod p°); or equivalently 9,(2) = 0 (mod p). 


This theorem had a new feature, in that it gives a condition involving only 
the exponent p, and not a possible solution (x,y,z) of Fermat’s equation as in 
most of the previous results. The original proof of Wieferich’s theorem was 
very technical, based on the so-called Kummer congruences for the first case: 

If p xyz and x? + y? + z? = 0, then for 2k = 2, 4,..., p — 3, we have 
the congruences (for a real variable v) 


K log(x + gl 


dv2* 


(as well as the similar congruences for (y,x), (x,z), (2,x), (y,2), (z,y)). These 
congruences were obtained with intricate considerations involving the arith- 
metic of the cyclotomic field and transcendental methods (the latter, as a 
matter of fact, may be replaced by p-adic methods). 

Thus, it suffices to show that 2°”! Æ 1 (mod p°) to guarantee that the first 
case holds for p. For a few years no such p was found. Only in 1913 Meissner 
showed that p = 1093 satisfies 2’~' = 1 (mod p°). The next prime satisfying 
this congruence was discovered by Beeger in 1922; it is p = 3511. Since then, 
computations performed up to 3 x 10° by Brillhart, Tonascia and 
Weinberger (1971) have not found any other such prime. Thus, in the above 
range, the first case holds for all but these two primes. 

The handling of these exceptional primes was actually done by a similar 
eriterion. Indeed, in 1910 Mirimanoff gave another proof of Wieferich’s 
theorem and showed also that if the first case fails for p then 3"! = 1 
(mod p?). The primes p = 1093 and 3511 do not satisfy this congruence. 

Several more criteria of a similar kind were successively obtained by 
various authors. In 1914 Frobenius and Vandiver showed independently 
that 4,6) = 0 (modp) and q,(11)=0 (mod p), if the first case fails for p. 
Successively, Pollaczek, Vandiver, Morishima proved that q,(m) = 0 (mod p) 
must hold for all primes m < 31. Morishima proved the same criterion for 
m = 37, 41, 43 (except for finitely many primes p). The exceptions were ruled 
out by Rosser in 1940 and 1941. However, in 1948 Gunderson pointed out 
that Morishima’s proof was incomplete. I have been assured by Agoh and 
Yamaguchi, who worked with Morishima and studied his papers, that the 
proofs are sound. 

Rosser, Lehmer and Lehmer, using the above criteria (up to m = 43), and 
the Bernoulli polynomials to estimate the number of lattice points in a 
certain simplex in the real vector space of 14 dimensions, gave the following 
well-known bound: 

If the first case fails for p, then p > 252 x 10°. 

These computations have been superseded by the bound 3 x 10° 
obtained using a computer, as I have already indicated. 


x By, = 0 (mod p) 
o 
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Furthermore, Gunderson devised, in 1948, another sharper method to 
bound the exponent. Assuming the Fermat quotient criteria up to 31, this 
gives the bound p > 43 x 108, and up to 43, the bound is p > 57 x 10°. 

Result (3). The largest prime known today! is the Mersenne number 
M, = 24 — 1 where q = 19937. It has 6002 digits. Its primeness was shown 
by Tuckermann in 1971, using the famous Lucas test: if q > 2, M, is prime 
if and only if M, divides S,. The numbers S, are defined by recurrence: 
S, = 4, 8,11 = SŽ — 2, so the sequence is 4, 14, 194,.... 

But how was it possible to show that the first case holds for such a large 
exponent? As a matter of fact, this is a consequence of Wieferich’s and 
analogous criteria, and it is a special case of a result which was proved suc- 
cessively by Mirimanoff, Landau, Vandiver, Spunar, Gottschalk. Namely: 

Suppose that there exists m not divisible by p, such that mp =a + b, 
where the prime factors of a and of b are at most 43 (this depends on the 
Fermat quotient criteria). Then the first case holds for p. Therefore, it holds 
for all Mersenne primes M, = 24 — 1, as well as for many other numbers. 

Do there exist infinitely many prime numbers p satisfying the conditions 
of the preceding proposition? This is an open question. In 1968 Puccioni 
proved: 


If this set of primes is finite, then for all primes | < 43, l £ +1 (mod 8) 
the set M, = {qq is a pime and RT = 1(q°)} is infinite. 


Primes in æ, are very hard to find, but this doesn’t preclude these sets 
being infinite. 

Result (4). The first lower bound for a counterexample to FLT was given 
by Grünert in 1856. He showed that if 0 < x < y < z and x" + y" = 7” then 
x >n. So it is useless to try to find a counterexample with small numbers. 
For example, if n = 101 the numbers involved in any counterexample would 
be least 102°. 

It was easy to improve this lower bound. Based on congruences of 
Carmichael (1913), if x? + y? = z?,0<x<y<z, then x > 6p’. 

But, with some clever manipulations Inkeri arrived at the lower bound 
already given. Taking into account that the first case holds for all prime 
exponents p < 57 x 10°, then 


> 2x 573 x 102° + 57 x 10957% 10° 
x 
log(171 x 10°) 





This is a very large number; it has more than 18 x 101! digits! 


1 Since this book was written, a larger prime M,, with q = 21701 was discovered by two 18-year- 
old students of California State University at Hayward. Laura Nickel and Curt Noll announced 
their discovery on November 15, 1978, and their computations were confirmed by Tuckermann 
(see Los Angeles Times, November 16, 1978, part II, page 1). The search lasted for three years, 
it required 440 computer hours. The new prime has 6533 digits. 
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Similarly, for the second case we may take p = 125000, hence 
x > (125 x 103)3* 125% 10°. 


This number has more than 18 x 10° digits. 

To give some sense of the magnitudes involved, I have inquired about 
some physical constants, as they have been estimated by the physicists. 

For example, the radius of the known universe is estimated to be 1078 cm. 
The radius of the atomic nucleus, about 10713 cm. So the number of nuclei 
that may be packed in the universe, is just about (1078* 13)? = 10??? —a very 
modest number indeed! 

But I should add that the above is rather controversial, and I have quoted 
it only to stress the enormous disparity between the sizes of the candidates 
for a counterexample to FLT, and the reputedly largest physical constants. 

Despite the monstrous size of the numbers involved, it is perhaps not 
quite safe to assert that no counterexample to the theorem will ever be 
available. Consider, for example, the equation 


1211% — 1 = (112 4 DU 10 — 1) 


which is easy to establish. Yet, the numbers involved have more than 10100 
digits. 

This being said, mathematicians had better try to prove FLT, or at least 
some weaker form of it, rather than look for a counterexample. 


Result (5). For example, it might be possible to show that the Fermat 
equation has at most finitely many solutions. It might even be that the 
number of solutions is bounded by an effectively computable bound. I 
should warn however that this has not yet been proved. 

It was only under a further restriction that a finiteness result was proved 
by Inkeri. He considered possible solutions (x,y,z) such that the integers are 
not too far apart, more precisely y — x < M, and z — y < M, where M>0 
Is given in advance. Then the problem becomes actually one of counting 
integer solutions of an equation involving only 2 variables. For this purpose 
there are the theorems of Siegel, or Landau, Roth, or similar ones. Actually 
keri and Hyyrö used the following: Let m, n be integers, max{m,n} > 3. 
et SR) = Gy X" + a, X"! + ‘+ a, € Z[ X], with distinct roots. If a is an 

ger, a + 0, then the equation f(X) = aY” has at most finitely man 
Solutions in integers. ` 

Given this theorem they proved statement (a). 

Concerning (b), I wish to mention that it partially answers a conjecture 
of Abel (1823). Abel conjectured that if x? + y? + z? = 0 (with nonzero 
Integers x, y, z) then, at any rate, x, y, z are not prime powers. I suppose that 
Abel might have had in mind a procedure, which would produce from a 
ofl solution (x, y, 2) another one (x,,y,,2,), where the minimum number 

Prime factors of the integers X, Y4, Zı is strictly smaller than it was for 
x, y, z. In this situation he would “descend” on this number, eventually 
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finding a solution with some prime-power integer—and if this turned out to 
be impossible, he would have proved FLT. 

To date Abel’s conjecture has not been completely settled. Sauer in 1905, 
and Mileikowsky in 1932 obtained some partial results. In 1954 Möller 
proved: 

If x" + y"=2",0<x<y<z, and ifn hasr distinct odd prime factors 
then z, y have at least r + 1 distinct prime factors, while x has at least r such 
factors. Ifn = p is a prime, this tells that y, z cannot be prime-powers. More- 
over, if p does not divide xyz, then x also cannot be a prime-power (this was 
proved by Inkeri in 1946). It remains only to settle the case p|xyz, and to 
show that x is not a prime-power. 

Inkeri has succeeded in proving that there are at most finitely many 
triples (x,y,z), as above, where x is a prime-power. Using the methods of 
Baker, which give effective upper bounds for the integral solutions of certain 
diophantine equations, Inkeri showed (1976), that 

x< y <expexp[2%(p — 1ugs DJe- D?, 

I pause now to indicate another very interesting use of Baker’s estimates. 

The famous Catalan problem is the following: to show that the only 
solution in natural numbers, x, y,m>1,n> 1, of the equation x” — y" = 1 
is x = 3, m=2, y= 2, n = 3. This problem is still open. However, using 
Baker’s methods, Tijdeman determined a number C > 0 such that if (x, y,m,n) 
is a solution then x, y, m, nare less than C. In particular, there are only finitely 
many solutions. 

Closely related is the following conjecture, which is a generalization of a 
theorem of Landau (published in his last book of 1959): 

Let a, <a, e: "` be the increasing sequence of all integers which are 
proper powers De, squares, cubes, etc. ..). Then liMp> (An+ı — A) = ©. 

In his result, Landau considered two fixed exponents m, n and the sequence 
of mth powers and nth powers. 


Result (6). Now I will turn to a more elementary result. 

In his very first paper on Fermat’s problem, published in 1837, Kummer 
considered Fermat’s equation with exponent 2n, where n is odd. And he 
showed that if it has a nontrivial solution, x?" + y?" = 27", with ged(n,xyz) = 1 
then n = 1 (mod 8). 

So, there exist infinitely many primes p such that the first case is true for 
the exponent 2p. 

Kummer’s result was rediscovered several times. It has also been im- 
proved. For example, in 1960 Long showed that if gcd(n,xyz) = 1, x” + y= 
z?" then n= 1 or 49 (mod 120). Some more elementary manipulation 
shows that ifm = 4 or 6 (mod 10) then X” + Y" = Z” cannot have a solution 
(x,y,z) with ged(m,xyz) = 1. But the best possible result dealing with the first 
case, for an even exponent, was just obtained by Terjanian. It plainly states 
that the first case is true for any even exponent. The proof is ingenious, but 
elementary. This leads to the speculation that there might be an elementary 


2. Explanations 27 


proof for the first case and arbitrary prime exponents. I think, however, that 
it shows rather that the equation with prime exponents is far more difficult 
to handle than with even exponents. 

Result (7). Schinzel’s conjecture has been supported by numerical evidence 
To date, no one has ever found a square factor of any Mersenne number. 
Moreover if p? divides a Mersenne number, then p > 9 x 108. l 

Rotkiewicz’s theorem says that Schinzel’s conjecture implies that there 
exist infinitely many primes p such that 2?~'#1 (modp?). Hence b 
Wieferich’s theorem, there would exist infinitely many primes 7 for which 
the first case holds. I believe, however, that a proof of this last statement, and 
a proof of Schinzel’s conjecture are equally difficult. = 


Result (8). To better explain the meaning of Vandiver’s result, it is neces- 
sary to return to Kummer’s monumental theorem: 


If p is a regular prime, then FLT holds for the exponent p. 


As I have already mentioned, Kummer was led to study the arithmetic of 
cyclotomic fields, to take care of the phenomenon of nonunique factorization 
into primes. To recover uniqueness Kummer created the concept of ideal 
numbers. Later Dedekind interpreted these ideal numbers to be essentiall 
what we call today ideals. However, it should be said that Kummer’s ideal 
numbers were in fact today’s divisors. Besides the ideal numbers, he con- 
sidered of course the actual numbers, that is, the elements of the cyclotomic 
field. For the ideal numbers unique factorization holds. Ideal number 
were called equivalent when one was the product of the other by an E 
PERE Kummer showed that the number of equivalence classes is finite—it 
called the class number of the cyclotomic field and usually denoted by h. 


1 
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1= ISCH 

a= ft XT Ie 
so 5, is a real positive unit of OC) Let V be the subgroup of ue generated by 
all these (p — 3)/2 “circular” units. Kummer showed that h* =(U*:V), the 

index of V in U*. en 

Moreover, he proved that if p does not divide h*, then p does not divide 
h*. Therefore p is a regular prime if and only if p does not divide h*. Then, 
he proceeded to compute h* for all primes p < 163 and he found the following 
irregular primes p = 37, 59, 67, 101, 103, 131, 149, 157. Based on his com- 
putations, he conjectured that the first factor h* = h*(p) of the class number 
is asymptotic to 


(p — 1)/2, let 








p-1)/4 
oa 219 = 2p (4) 


This conjecture, which agrees with recent numerical evidence, has yet to be 
proved. In 1951, Ankeny and Chowla proved that 


logh*(p) = log y(p) + olog p) = (p + 3)logp — 3p log 2x + ollogp). 
Later, in 1964, Siegel published his weaker result: 
log h*(p) = log y(p) + o(p log log p). 


It follows that log h*(p) ~ 4p log p. l 
In 1976, Masley and Montgomery showed that if p > 200, then 


(27) Ring" 3 < h*(p) < (27) P/2 pP t314 
and Pajunen has shown also in 1976 that if 5 < p < 641, then 
3y(p) < h*(p) < Zap) 


Concerning the growth of the first factor, Ankeny and Chowla proved in 
1951 that there exists pọ such that h*(p) is monotonically increasing for 
p > po. It is conjectured by Lepistö that one may take Po = 19. l 

The second factor is much more difficult to handle, since it is tied to the 
structure of the group of units. It was Kummer who already found the first 
example, p = 163, for which h*(p) is even. However not many more examples 
were known before 1965, when Ankeny, Chowla, and Hasse, using a lemma of 
Davenport and class field theory, proved: if q is a prime, n> 1, and p= 
(2qn} + 1 is a prime, then h*(p) > 2. l l À 

If p|h*(p) but pi h*(p) the cyclotomic field is called properly m we 
It is improperly irregular if p|h°(p) and so p|h*(p). It is not known w fi er 
there are improperly irregular cyclotomic fields. At any rate, none has een 
found for p < 125000. Vandiver, Pollaczek, Denes, and Morishima have 

ied i fields. 
ae (8) finds its origin in Kommerz work of 1857, where 
Kummer considered in depth the first case for irregular primes. Vandiver 
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analyzed the work of Kummer, corrected mistakes, filled gaps and was able 
to generalize it to include the second case also. A paper written by Vandiver 
in 1934 contains the claim that if p does not divide the second factor, then 
the first case of FLT holds for p. However, his proof is now considered as 
questionable. 


Result (9). The result of Eichler is beautiful and far reaching, and the 
method used relies on basic principles, rather than on the previous criteria. 
Actually, Eichler proved that if the first case fails for p then pl“?! divides h*. 
The other assertion may be proved in the same way. 

Without knowing this variant of Eichler’s theorem, Skula proved that if 
the p-rank of the ideal class group of Q(¢) is 1, that is, if the p-class group is 
cyclic, then the first case holds for p. A simpler proof was given by Brückner. 
But, this is contained already in Eichler’s theorem. 


Result (10). To explain the scope of the latest of Briickner’s theorems, let 
T be the ideal class group of Q(¢), h the class number. If p is irregular, then 
T + pr so y, = dim(T/pr) = 1 (where I’/pT is considered as a vector space 
over the field with p elements). In 1965, Eichler proved that if the first case 
fails, then y, > yp — 2. However, the computation of yp is difficult. Briickner 
succeeded in relating the above dimension y, to the irregularity index ii(p), 
and proved that if the first case fails for p, then more than Vp — 2 Bernoulli 
numbers B,, (with 2 < 2k < p — 3) are multiples of p. 

This fits into a series of classical results. Cauchy- (1847) and Genocchi 
(1852) proved that if the first case fails for p, then P. is a multiple of p. 
In 1857, Kummer showed that both B,_; and B,; must be multiples of p. 
Later, Mirimanoff showed that B,- and B,_, must also be multiples of p. 

In 1934 Krasner proved quite an interesting result: there exists a prime 
Po (which could be effectively computed) such that if p > Po and if the first 
case of FLT fails for p, then the k Bernoulli numbers Ra Bn By-(2x+1) 
are all multiples of p; in this statement k = [x/logp]]. Thus, in the event of 
a solution in the first case of FLT a reasonably large number of successive 
Betnoulli numbers would be multiples of p. Even though this number is 
usually smaller than the one indicated by Brückner’s theorem, in this case 
the Bernoulli numbers are consecutive. This is a most unlikely conclusion, 


perhaps pointing to the fact that the first case of Fermat’s theorem may very 
well be true. 
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LECTURE III 


B.K. = Before Kummer 


In this lecture, I wish to report various early attempts to solve Fermat’s 
problem. I begin by considering the case of exponent 2, which is much earlier 
than F ermat's time. As Zassenhaus kindly pointed out to me, 2 is the oddest 
of the primes. Among its special properties, this oddest of all the primes is 
even; it is also the only exponent for which it is known that the Fermat 
equation has a nontrivial solution. 

Then, I will give Fermat’s famous proof by infinite descent for the case 
of fourth powers. After that, I will present Euler’s and Gauss’s proof for the 
case of cubes, as well as sketch proofs for other exponents. 

Other attempts were not restricted to specific exponents. Noteworthy— 
though not successful—were the contributions of Cauchy and Lame. Barlow 
and Abel found interesting relations which must be satisfied by any possible 
solution of Fermat’s equation. Sophie Germain proved a clever and beautiful 
theorem for the first case. 

Since I have already given some of the early history of the problem in 
Lecture I, I will now limit myself to the technical details of the theorems 

All these methods, devised before Kummer, have a certain naiveté Ñ 
common. Normally, they use only properties of the rational numbers 
However elementary they may have been, they didn’t lack ingenuity. On the 
contrary, they were often very tricky. | 

Before entering into the details, I wish once more to recall: 
ae first case of Fermat’s theorem holds for the prime exponent p > 2 
3 5 ee Ge exist integers x, y, z, such that p does not divide xyz and 
. The second case holds for p > 2 when there do not exist relatively prime 
integers x, y, z, such that x? + y? + z? = 0 and p divides x, y, Or Z. 
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In order to prove Fermat’s theorem for all exponents n > 3, it is sufficient 
to prove it for the exponent 4 and for all prime exponents p > 3. Because of 
this, I will mainly be interested in prime exponents. 

Another obvious though still general remark, which I will not repeat is 
the following. If there exists a solution for Fermat’s equation, 


x" + y" + 2" =0, 


we may assume without loss of generality that gcd(x, y,z) = 1 and thus x, y, Z 
must be pairwise relatively prime. 


1. The Pythagorean Equation 


If x, y, z are nonzero integers satisfying 
KFZ, (1.1) 


z|. These numbers are the lengths of the sides of a right- 





then so are |x|, el, 
angled triangle. 
To determine all nontrivial integer solutions of (1.1) it suffices to determine 


the so-called primitive (Pythagorean) triples (x,y,z): 
x, y,z > 0, ged(x,y,z) = 1, x even. 
All the other solutions are obtained by changing signs, permuting x, y, 


and by multiplication with some nonzero integer. e 
The following theorem gives a complete description of all the primitive 


triples: 


(1A) If a, b are integers, a>b>0, gcd(a,b) = 1, a, b not of the same 
parity, let 


x = 2ab, 
y =a’? — B’, (1.2) 
z=a’* +b’. 


Then (x,y,z) is a primitive triple. And conversely, every primitive triple may be 
so obtained. 


Distinct pairs (a,b) give rise to distinct primitive triples and these may all 
be obtained from some pair (a,b). For example, the smallest primitive triples, 
ordered according to increasing values of z are as follows: 

(4,3,5) (12,5,13), (8,15,17), 
(24,7,25), (20,21,29), (12,35,37). 


Listing ofthe primitive triples amounts to determining the representations 
of odd positive integers as sums of two squares. Fermat proved, in this 
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context, the well-known theorem: 


(1B) n > 0 is a sum of two squares of integers if and only if every prime 
factor p of n, such that p = 3 (mod 4), appears to an even power in the fac- 
torization of n into prime factors. 


There remains then the question of finding the number of representations 
as sums of two squares. 

Let r(n) denote the number of pairs (a,b) of integers (not necessarily 
positive) such that n = a? + b?. For example, r(1) = 4 and r(5) = 8. The 
determination of r(n) was done by Jacobi and by Gauss, independently: 


(1C) r(n) = 4(d,(n) — dein 
where 


d,(n) = #{d|1<d,d|n, d = 1 (mod 4)}, 
d3(n) = #{d|1<d, d|n, d = 3 (mod 4)}. 


With this information, it is possible to determine all the primitive Pythag- 
orean triples. Clear proofs of these beautiful theorems may be found in the 
book of Hardy and Wright. 


2. The Biquadratic Equation 


Here is Fermat’s proof using the method of infinite descent. The idea is the 
following: assume that (X9,Vo,Zo) is one solution in nonzero integers, then 
there is another solution (x;,y1,2,) of the same kind, with 0 < |x,| < |xo|. 
Since this procedure may be repeated indefinitely, one would obtain an 
infinite decreasing sequence of positive integers |xo| > |x,| zz: > 0— 
which is absurd. So, there couldn’t be any solution in nonzero integers. 


(2A) The equation 

X*+Y*=Z? (2.1) 

has no solution in nonzero inte ] 
gers. In particular, the i 

EE same is true for the 

X* + ¥4= Zi. (2.2) 
Proor. Let (x,y,z) be a triple of positive integers satisfying (2.1). It is easy to 
See that we may assume, without loss of generality that gcd(x, y,z)= 1. We 


may also assume that x is even. Then (x”,y?,z) i imiti 
f ‘ ‚y‘,z) is a primitive P 
en p ythagorean 
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By (1A) there exist integers a, b such that a> b > 0, gcd(a,b) = 1, a, b 
have different parity, and 


x? = 2ab, 
y=a—b’, (2.3) 
z=a? +b’. 


As easily seen, b must be even. E 
Since b? + y? = a? and ged(b,y,a) = 1, by (1A) there exist integers c, d 
such that c > d > 0, gcd(c,d) = 1, c, d of different parity, and 


b = 2cd, 
y=- 8, (2.4) 
a= +g. 
Hence 
2 = 2ab = 4cd(c? + a’). (2.5) 


Since c, d, c? +d? are pairwise relatively prime, from (2.5) using the 
uniqueness of factorization into primes, we conclude that c, dc + d° are 
squares of positive integers: 


c =e’, 
d= f?, (2.6) 
+a =g. 
Hence 
et + ft =9?, (2.7) 


that is, the triple (e, f,g) is a solution of (2.1). ae 
But z = a? + b? = (c? + d?)? + 4c?d? > g* > g > 0. By infinite descent, 
oO 


this leads to a contradiction. 


Various equations of 4th degree, similar to (2.1) may be treated with this 
methods. I quote the following theorems: 


(2B) The following equations have no solution in nonzero integers: 


Xt- Yt = +2’, (2.8) 
X*44Y*=Z*, (Euler) (2.9) 
Xt —4Yt = +Z’. (2.10) 


Also Legendre proved: 
(2C) If x, y, z are nonzero integers: 
If x* + y* = 222, then x? = y? and 2? = ae (2.11) 
If 2x* + 2y* = 22, then x? = y?, and 27 = Ar, (2.12) 
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3. The Cubic Equation 


In my first lecture, I mentioned that the first published proof of Fermat’s 
theorem for the case of cubes is due to Euler. It appears in Euler’s Algebra 
published in St. Petersburg in 1770. This book was posthumously translated 
into German in 1802 and into English in 1822. An important step in Euler’s 
proof, which used divisibility properties of integers of the form a? + 3b, was 
done without sufficient justification. Legendre, who reproduced Euler’s 
proof in his book (1830) did not give any further explanations. Since he was 
himself also an expert on such matters, he had certainly understood Euler’s 
reasoning. However, later mathematicians were less comfortable about the 
possible gap. In 1894, Schumacher pointed it out explicitly. The gap was 
again the object of comments by Landau (1901), a paper by Holden (1906) 
and a note by Welsch (1910). Quite recently in 1966, Bergmann published a 
thorough analysis of Euler’s proof, with historical considerations, throwing 
more light on this controversy. Indeed, already in 1760, in his paper Supple- 
mentum quorundum ...supponuntur, published in Novi commentarii aca- 
demiae scientiarum Petropolitanae (also in Opera Omnia, series prima, II, 
pages 556-575), Euler rigorously proved that if s is odd and si = a? + 3b?, 
with gcd(a,b) = 1, then s = u? + 3v?, with u, v integers. 

Another proof of Fermat’s theorem for cubes was given by Gauss, and 
published posthumously. Both proofs use the method of infinite descent. 
However, while Euler worked with integers of the form a? + 3b?, Gauss used 
complex algebraic numbers of the form a + b,/—3. 


(3A) The equation 
X34 ¥2+Z3=0 (3.1) 
has no solution in nonzero integers. 
PROOF. Suppose that x, y, z are pairwise relatively prime integers such that 
x? + y? + 23 = 0, with x, y odd, z even and |z| is the smallest possible. This 
may be assumed without loss of generality. Then 
xX + y= 2a, 


BR ER (3.2) 


_ where a, b are relatively prime nonzero integers of different parity. Therefore 


=2 =x? + y? = (a+ bP + (a — b}? = 2a(a? + 362). (3.3) 
It follows easily that a? + 3b? is odd, 8 divides 2a, b is odd and 
gcd(2a, a? + 3b”) = 1 or 3. 
Case 1. gcd(2a, a? + 3b?) = 1. 
From (3.3), 2a and a? + 3b? are cubes: 
2a=r,, 


a? + 3b? = 5? (3.4) 
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where s is odd. At this point, according to Euler, it is possible to write 
s = u? + 3v?, with integers u, v such that 


= u(u? — 9), 
5 5 (3.5) 
b = 3v(u* — v’). 
Then v is odd, u # 0, u is even, 3./u, ged(u,v) = 1, and 
r? = 2a = 2u(u — 3v)(u + 3v). (3.6) 


Note that 2u, u — 3v, u + 3v have to be pairwise relatively prime, so they 
are cubes of integers: 


Qu= — Ë’, 
u — 3v =m’, (3.7) 
u + 3v =n’, 
with |, m, n not equal to zero (since 3 does not divide u). Thus 
| Bım+n=0 (3.8) 
with | even. Moreover, since b # 0, 3 Yu: 
z| = |2a(a? + 3b?)| = Pu? — 9v2)(a? + 3b2)| > 3|P| > Pl. 
This contradicts the minimality of |z]. 
Case 2. gcd(2a, a? + 3b?) = 3. 
Let a = 3c, then 4|c, 3b and 
— 23 = 6c(9c? + 3b?) = 18c(3c? + b’), (3.9) 


where 18c, 3c? + b? are relatively prime, 3c? + b? is odd, and not a multiple 


of 3. 
Then 18c, 3c? + b? are cubes of integers: 


18c =P, 

3c? + b? = 8°, 

with s odd. By the same step as in Case 1, s = u? + 3v?, with integers u, v 
such that 


(3.10) 


b = u(u? — Or 
c = w(u? — v?). 


Thus u is odd, v is even, v # 0, ged(u,v) = 1, and 2v, u + v, u — v are pairwise 
relatively prime. From 


(3.11) 


3 
G) = 2v(u + cn — v) (3.12) 
it follows that 
2v= —P, 
utv=m, (3.13) 


u-v=—n. 
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5 
Hence H + mi + ni = 0 with I, m, n nonzero integers, | even. Finally 
|z|? = 18|c|(3c? + b?) = 54|v(u? — v?)|(3c? + b?) 
= 27|I|?|u? — v?|(3c? + b?) > 225 > WS. 
This contradicts the choice of |z| minimal. o 


To justify the step concerning the expression of s as s = u? + 3v?, it is 
necessary to study more deeply the quadratic form x? + 3y?. This is Siemens 
tary but a bit too long to explain in detail. Therefore I will only summarize 
the main points; details were given, for example, by Carmichael in 1915 


Let S = fa? 2 ; EE f ; 
ee {a* + 3b*|a, b integers}. The following is a list of various prop- 





(3B) S is closed under multiplication: 
(a? + 3b?)(c? + 3d?) = (ac + 3bd)? + 3(ad — be)? 
= (ac — 3bd)? + 3(ad + be}. (3.14) 


(3C) Let p be a prime and n> 1. If pe S and pne S, then neS. 
(3D) A prime p belongs to S if and only if p = 3 or p = 1 (mod 6). 
GE) If m = a? + 3b? € S, with ged(a,b) = 1 and if n|m, then ne S. 
This gives a complete description of S: 


H m is in S if and only if the following condition is satisfied: if p is a prime 
p = —1 (mod 6), or p = 2, then the exact power of p dividing m is even. 


e Ger If p is a prime number, p = a? + 3b? = c? +38, then a? = c?, 


SE Let m,, m, be produors of primes belonging to S, let m = mım,. For 
representation m = u* + 3v? there are representations 


m, =a? + 3b?, 

Ge 2 (3.15) 
such that Zn 

u = ac — 3bd, 

v = ad + be. (3.16) 
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And finally: 


(31) Let s be an odd integer, such that s? = u? + 30?, with nonzero integers 
u, v. Then s = t? + 3w?, with t, w integers and 
u = t(t? — 9w?), 


e (3.17) 


The above property was the one needed by Euler. 

Now I turn to Gauss’s proof. As a matter of fact, as I hope to explain, 
it does more than prove Fermat’s last theorem for exponent 3. 

Gauss worked with complex numbers of the form a + bt, where a, b are 
integers, and €=(—1 + J-3IR2 is a primitive cubic root of 1. Let A be the 
set of all such numbers. Using modern language, A is a ring, namely the ring 
of algebraic integers of the imaginary quadratic field K = Q(./—3). Every 
(nonzero) element in A whose inverse is again in A is called a unit of A. The 
units of A are known to be: 1, —1, č, —¢, (7, —(?; they are all roots of unity. 

If a, ße A, a divides ß when there exists y € A such that ay = ß; this is 
written «|B. If «|8 and fla then « and f are associated; « ~ ß. This happens 
only when « = fy where y is a unit. 

ae A is a prime element if the only elements of A dividing « are either 
associated with a or units. There are “enough” prime elements, so that every 
a e A may be written as a product of a unit and powers of prime elements. 
For the rings in question, such a factorization is essentially unique: if 


a= one ni = ont ne“ with œ, œw units, $ > 0, s > 0, z; distinct 
prime elements, e; > 1, 77) distinct prime elements, e; > 1, then s = s’, and up 
to a permutation of indices m; ~ Tis.. -> ns ~ Tl, Cy = Chr. ês = Eg 


Since there is unique factorization into primes, it is possible to define the 
greatest common divisor of elements of A—they are unique, up to multiplica- 
tion by units. Elements of A are relatively prime if their greatest common 
divisors are units. 

An element which plays an important role is A=1-{= (3—./—3)/2. 
A is a prime element and 3 ~ 2’. 

If a, B, ye A, then «= H (mody) means that y|« — H This congruence 
relation satisfies properties analogous to ordinary congruence. 

I note that since 3 ~ 42, if a = ß (mod 2), then ai = ß? (mod 2’). 

There are exactly three congruence classes modulo 4, namely the classes 
of 0, 1 and —1. 

The following congruence was needed in Gauss proof: 


Lemma. If ae A and } X a, then «è = +1 (mod 2$). 
The proof is straightforward. And now, Gauss’s theorem: 


(3J) The equation 
X? + Y? + Z=0 (3.18) 


has no solutions in algebraic integers a, P, y € A all different from 0. 
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PROOF. Suppose the theorem is false. Dividing by a greatest common divisor 
there is no loss of generality in assuming that there exists «, ByeA relatively 
prime, such that a? + B? + y? =0. It follows that o, ß, y are also pairwise 
relatively prime. Hence it may be assumed that 4 Fa 4Y B. 


Case l. AF 


er classes of a, ß, y are therefore those of 1 or —1.So« 
mod å), hence «? = +1 (mod 3). Similarly, 8? = + 3) vi 
(mod 4°). So R EC 


O-aiiBi +y = +1+1 + 1(mod4’). 


The combinations of signs give + 1 or +3. Clearl 
l + +3, y0# +1 (mod 4°); also 
if 0 = +3 (mod 2°), then 43| +3 ~ +42 hence 2 SCH 
SE [+3~+ nce A| +1 and A would be a unit, 
Case 2. A|y. 

Let y = 4"6 with n > 1,146, ô e A hence 


a + B? + 13"53 =0 (3.19) 
with «, B,d€ A,n > 1. 


So the following property (P,) is satisfied. 


(Pa): There exist o, P, 8 € A such that AY a, AX B44 6, a B are relatively 
prime and o, P, ô are a solution of an equation of the form 


X? + Y? + oA”? = 0, (3.20) 

where o is a unit (in (3.19), œ = 1). 
The idea of the proof is the following: to show that if (P,) is satisfied, then 
n> 2 and (D. Ais also satisfied. Repeating this procedure, eventually (P,) 


would be satisfied, which is a icti is i 
\ í À contradiction. This is nothing bu 
infinite descent (on the exponent n). en 


So there remain two steps in the proof. 


Step 1 . If (P,) is satisfied, then n > 2. 
Since Aya, 148 by the lemma a’ = +1 (mod 4*), 8? = +1 (mod 4%) 


and +1 +1 = - win? (mod 191 with 4 
Eher ith A 6. The left-h i 
since À Y +2, hence 3n > 4 and n > 2. d EHS 


Step 2. If (P,) is satisfied, then (P„-ı) is also satisfied 
By hypothesis: ) 
— sun 2 AH cae 
wA"? = a? + B? = (a + Bla + Cpa + CRL (3.21) 


The prime element A must divide i 
N 2 a one of the factors in the right-hand side. 
ow a+ B=at (B=a+C?B (mod}) because 1 = t = C? (mod Al i 3 


divides each factor. Hence 
a+ BP att at lB 
RE A 
A 
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and 
ai a+b a+ 0p 
A A ak, 7 


From n > 2 (by the first step) A divides one of the factors in the right-hand 
side. It is easily seen that «+ ß, « + CB, a + CR are pairwise incongruent 
modulo 22. Hence å divides only one of the factors of right-hand side of (3.22). 
For example A divides (x + ß)/A (the other cases are analogous, replacing D 
by ¢f or £?ß, which is permissible). 

So >" =D divides (o + ß)/A. Hence 


a + B = 13-4, 


(3.22) 





viii D83 = 


a+ CB = Ako, (3.23) 
a+ (7B = kz, 
with Ki, K2, K3 € A and A does not divide Ky, K2, Ka. 
Multiplying: 
te — 06? = KıKaka. (3.24) 


It is easy to see that Kaka, K3 are pairwise relatively prime. Since the ring A 
has unique factorization, K1, K2, K, are associated with cubes 


Ky =i 
K3 = N2093, (3.25) 
K3 = 13035 


where y; are units, gy; € A (i = 1,2,3), G1, P2, 3 are pairwise relatively prime, 

and A does not divide 1, @2, #3. Then 

a + = np), 
a + CB = A203, (3.26) 
a + CB = Ans@3. 
From 1 + ¢ + ¢? = 0 it follows that 
O = (æ + p) + Cla + CB) + O(a + 7B) 
= 2,01 T SIE + C7An393 


SO 
p3 + 793 + =, (3.27) 


where 1, € are units, @1, 2, 93 € A are not multiples of A, and os, @3 are 


relatively prime. 
Ift = 1, then 2, @3, p, is a solution of 


X? 4 Y? pr OZ? = 0. (3.28) 
Ift = —1, then p2, — 3, 9, is sucha solution. 
It remains to show that t # +¢, +¢7. Indeed, since n > 2, 
p3 + tọ? = 0 (mod 4°). (3.29) 
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But by the lemma 
= +1 (mod 24), ois +1 (mod Ai 
( 


hence +1 + t = 0 (mod 4°). 
However +1+¢=0 (mod}?) and +1 +? =0 (mod}?). So t # ti 
LC and this establishes the propery (P„-1), concluding the proof. O 





Various other cubic or sextic equations were treated by similar methods: 


(SK) The following equations have no solutions in nonzero integers: 


X?+4Y°’= 1, (3.30) 
AT 27Y° = 22°, (3.31) 
16X° — 27Y° = Z?, (3.32) 
X?’+Y?=32°. (3.33) 


4. The Quintic Equation 


In 1825, Dirichlet read at the Academy of Sciences of Paris a paper where 
he claimed to have proved Fermat’s theorem for the exponent 5. However. 
he neglected to consider one of the possible cases. In the meantime, Legendre 
independently found a complete proof, while Dirichlet was settling the 
remaining case to finish his proof. 

Essentially, Dirichlet’s proof uses facts about the arithmetic of the field 
K= au). It would take too long to describe the proof in detail. It suffices 
to say that it proceeds by considering two cases separately. The first case is 
quite easy. The second case was treated by infinite descent. 

The following lemma is the basic technical tool in the proof: 


Lemma. Let a, b be relativel i ; : i 
5¥a,5 |b. If y prime nonzero integers of different parity, 
e 5 
em (1) (e5) get 
H 2 \ 


(with e = 0, f, g integers of the same parity) then there exist relatively prime 
integers c, d, of difierent parity, 5 Fe, such that 





a = c(c* + 50c?d? + 125d*), 


b = 5d(c* + 10c?d? + 5d*). Ee 

A similar lemma is necessary if (a? — 5b?)/4 is of the form (4.1). These 
lemmas reflect the unique factorization (up to units) of elements into products 
of prime elements, which is valid in K = Q(/5). 
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5. Fermat’s Equation of Degree Seven 


In 1839, Lamé proved Fermat’s theorem for the exponent 7. He had been 
preceeded by Dirichlet, who proved the theorem for the exponent 14 in 
1832. It should be emphasized that the proof for 14 was substantially easier 
than for 7. Obviously, Dirichlet would have been happier to discover a proof 


for the exponent 7. 
In 1840, Lebesgue found a much simpler proof than Lame’s. It used the 


following polynomial identity. 


(X+Y¥+Z)’ —(X74+ Y74+Z’) 
=X + YY + ZUZ + X)[(X? + Y? + Z? + XY + YZ + ZXY 
+ XYZ(X+Y+Z)] (5.1) 
which was used already by Lame. 
Cauchy and Liouville, while reporting on Lamé’s paper of 1839, indicated 
other general polynomial identities. If p is a prime, p > 3, then 
(X +1)? — X?-—1=pX(X + 1)(X? + X + 1)°G,(X), (5.2) 
where 
S 1 when p= —1 (mod 6), 
© 2 when p = 1 (mod 6), 
and G,(X) is a polynomial with integral coefficients, not a multiple of 
X?4+X+1. 
The identity used by Lamé in 1840 was: If m is odd, then 
(X+Y+Z”"-(X+Y-Z"-(X - Y+Zy"—(-X+Y+Z)" 


(m — 1)! 2ay2b72 
= x2ey22zZ2 (5.3) 
7 Kä (2a + 1)'(2b + 1)'(2e + 1)! 


a+b+c=(m-3)/2 





With such complicated tools, Lamé published a “proof” of Fermat’s 
theorem for arbitary exponent. As I explained in my first lecture, this was a 
fiasco, because Lamé made unjustified use of what amounts to unique 
factorization in the ring of cyclotomic integers (generated by the nth roots of 
1)—and this is not generally valid. 

For the exponent 7 the main steps in Lebesgue’s proof are the following: 


(a) If x, y, z are pairwise relatively prime nonzero integers such that 
x’ + y +27 =0 by (5.1): 


s’ = Tut, (5.4) 
where 
sex+y+tz 
u = x? + y? +27 + xy + xZ + YZ, (5.5) 


v = (x + yy + z)\(z + x), 
t = u? + xyzs. 


S 
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(b) Then v £ 0, s # 0, v and s are even, u is odd, t = 1 (mod 4), gcd 
H > a ot = > t, SS 
1 ged(t,v) = 1. GE 
(c) tis the 14th power of an integer and 7 kt Let t = ai? 
} . =qg'",qlusou= gr. 
(d) v = 7°p’ with p even, hence | = 


(x + yx + 2y + 2) = 7%", (5.6) 

(x? + y? +2? + xy + yz + 2x)? + xyzs = A (5.7) 
x+y+z=7pg?, (5.8) 

X? +y? + zZ xy tx +yz = qr. (5.9) 





; 172 
(e) Putting rg roi =a, q? = b, pi =2"*1¢ and manipulating with 
the above relations yields 


a? = bt — 22m x 3 x T4h2e? + 2Hmtl) x Tiet (5.10) 


where a, b, c are odd and relatively prime. 
(f) The proof is concluded by showing that (5.10) is impossible. This is 


done by induction on m. As a matter of fact, this step i 
i » is actually | 
the rest of the proof. P y longer than 


Summarizin g [have tried to describe, at least in part, the proof of Lebesgue 
(already simpler than Lamé’s) so as to convey the idea of how involved the 
proofs were becoming, as the exponent increased. 

Such elementary methods are not of the same level of difficulty as the 
problem. But, by this time, there were already many other attempts, which 
which T'I] describe in the following lectures. 
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LECTURE IV 


The Naive Approach 


In this lecture, I will relate what has been done with Fermat’s problem 
without using any sophisticated methods. Let me say, that these attempts 
should not be looked down on. On the contrary, they show much ingenuity, 
and they have helped to understand the intrinsic difficulties of the problem. 
DI point out, in various cases, how these attempts have brought to light 
quite a number of other interesting, perhaps more difficult problems than 
Fermat’s. 

If I have decided to group these various results under the heading of 
“the naive approach,” it is only because Fermat’s problem has proved itself 
to be at another level. In fact, it is possible that all other approaches tried 
as yet may someday be considered naive. Who knows? 

I will group the various results according to the main ideas involved, 
rather than chronologically. 


1. The Relations of Barlow and Abel 


A natural thought in trying to prove Fermat’s theorem is to assume that 
there exist integers x, y, z, different from 0 (not multiples of p in the first case) 
and satisfying the Fermat equation 


x? + yP + 2? =0, (1.1) 


where p is an odd prime, and then, though whatever manipulations our fancy 
dictates, to derive relations involving these numbers x, y, z, and p. In this 
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way, the idea is to aim to reach a contradiction, thereby proving that (1.1) 
was impossible. 
Since 


=z = xP + y? = (x + ya! — xPT2y + xP? 2 _...—xyP2 4 yPp-1), 
(1.2) 


(x? + y?)/(x + y) is an integer and it is certainly of importance to study this 
expression and its divisibility properties. 
From the very beginning, attention was given to the study of 


n-1 


Q,(a,b) = 3. air") (1.3) 


k=0 
for n > 1, a, b nonzero integers. 
Ifa + b #0, then Q,(a,b) = (a" — (—b)")/(a + b) and if moreover n is odd 
then 
a" + b" 


a+b Ka 





Q,(a,b) = 


Results of this kind are scattered through the literature. I prefer to list 
them together here, even though some of the properties won’t be required 
until much later. The proofs are exercises. To avoid repetition, in what 
follows, a, b will be nonzero relatively prime integers, n, m > 1 and p is a 
prime. 


Lemma 1.1 (Birkhoff and Vandiver 1904; also Inkeri, 1946). If pla" +b" 
but p X a” + b" (for every proper divisor m of n), then p = 1 (mod n). 


Lemma 1.2 (Möller, 1955; see also Inkeri, 1946 and Vivanti, 1947, for 
parts (4), (5), (6)). 


1. If gcd(n,m) = d, then gcd(Q,(a,b), Q,,(a,b)) = Q.(a,b). 
2. [[pin Q,(a,b) divides Q,(a,b). 
3. If nis odd, n > 3 and a > b, then Q,(a,b) > n. 


The equality holds only when: a= 1, b= —1; or n = 3, a = 2, b = 1; or 
n =3,a = —1;b = —2. 


4. gcd(Q,(a,b), a + b) = ged(n,a + b). 

. If p #2 and p*||a + b (with e > 1), then pert are b. 

. If p #2 and p\a + b, then p° ¥ Q (a,b). 

. Ifp #2, p*||a+b,e > 1 and n = p'm, p% m, r 20, then petla — (— bY. 
. If every prime factor of n divides a + b, then n(a + b) divides a" — (— bY". 


CO JA Ch Lë 


In one form or another, these lemmas are the background of several 
results to be mentioned. 
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Barlow (famous for the Barlow tables), discovered in 1810 the following 
relations; these were found later in 1823 by Abel, who mentioned them in a 
letter to Holmboe. 


(1A) If x, y, z are pairwise relatively prime (nonzero) integers, satisfying 
xP + y?” + z? = 0 and if p # 2 does not divide z, then there exist integers t, t, 
such that 


xP + y? 
x+y 
Moreover, p X tt,, gcd(t,t,) = 1, t, is odd and t, > 1. 


x+y=P??, 





=t}, 2= thy: (1.5) 


PROOF. Since x + y +z =x’ + y? + z’ = 0 (mod p), it follows that —z = 
x + y (mod p), so pYx + y. 
Clearly 
(—2)? =x? + y? = Q,(x,y) “(x + 9). (1.6) 


By Lemma 1.2, part (4), the two factors Q,(x,y), x + y are relatively prime. 
By unique factorization, they are pth powers. 
The rest of the proof is even easier. O 


So, if pairwise relatively prime integers x, y, z are not multiples of p and 
satisfy (1.1), then the Barlow-Abel relations 











xP + yP 
x+y=ť, SE z= -tt 
8 ye + 2? 
yrz=", y+z =ri, =, (1.7) 
zP + xP 
z+x=sP, Re: = si, y= —SS,, 


are satisfied. 
In the above expressions the integers r, s, t, r,, $4, tį are not multiples of 
P, 1, Sy, ty are odd, ged(t,t,) = gcd(r,r,) = ged(s,s,) = 1, and gcd(r,s,t) = 
Bed(r,5),t) = 1. ere ere eRe 
(1.7) may be rewritten as 


r? P P —yP P p 
x=-r+ wissen 











2 2 ? 
rP + SPHE Pepe 
= — SÉ ~ 
EE EES (1.8) 
z= pE ESEE E 
a en 


Abel had also indicated relations like the above, wh 
5 en ‚I 
with (1A), Abel stated (1823): SE 


54 IV The Naive Approach 


(1B) If x, y, z are pairwise relatively prime nonzero integers satisfying (1.1), 
if p #2 divides z, then there exist integers n> 2, t, r, S, ti, 71, Sı such that 











ee yap, SE = D, z= —p"th, 
P + oP 
y+z=r?, ES =r], x= if, (1.9) 
5 EE 
z+x=sP, en = si, y = —SSy. 


As before, r, s, t, 71, S4, t, Satisfy the same properties already indicated. 


Proor. In view of (1A) the relations of the last two lines of (1.9) must hold. 
By hypothesis x + y = — z = 0 (mod p). Writing u = x + y = p""'r' with 
m> 2, př ť and writing 
x? +y XE (ux) 
x+y u 





Q(x y) = 


= uP“! — puP-2x + Gs AL pe 


then by Lemma 1.2, gcd(x + y,Q,(xy)) = p. So Q,(x,y) = pt, p4{ ti. From 
(—2)? = Q(x, y) (x + y) = p't't 


it follows that m is a multiple of p, m = pn, leading to the required relations. 
The other assertions are immediate, except that n > 2 still needs to be 
proved. 
From 


ytz=r, z+x=# 


it follows that 
(r +s) =r +s? =y+x+2z=0(modp). 


So r = —s (mod p), hence r? = —s? (mod p°). 
By (1.9) 
22 = 1? + 5? — EI = 0 (mod p”) 


showing that n 2 2. = 


2. Sophie Germain 


Sophie Germain was a French mathematician, a contemporary of Cauchy 
and Legendre, with whom she corresponded. Her theorem, brought by 
Legendre to the attention of the illustrious members of the Institut de 
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France, was greeted with great admiration. I want to indicate how beautiful 
and neat is her result. Its core is the following proposition: 


(2A) Let p, q be distinct odd primes, satisfying the following conditions: 


1. p is not congruent to the pth power of an integer modulo q. 
2. If x, y, z are integers and if 


x? + yP + 2? = 0 (mod q), (2.1) 
then q divides x, y or z. 
Then the first case of Fermat’s theorem is true for the exponent p. 


Proof. Assume the contrary: x, y, z are pairwise relatively prime integers 
such that x? + y? + z? =0. A fortiori, from hypothesis (2), g divides one 
(and only one) of the integers x, y, z. Say, q|x, q 4 yz. By Relations (1.8) 


—r? + s? + t = 2x = 0 (mod q). 


By hypothesis (2) q divides r, s, or t. As easily seen, g|r, qX st. So the 
following congruences hold (by (1.7)): 


y = —2(mod q); 
t? = yP ~! (mod gq) because (x + y)t? = x? + y’; 
P + zP 


+2 





r? = pt? (mod q), because r? e = yP GE de EE E ke ECKE DEE 
= bf "zs pt? (modą). 


Noting that q Fr, and letting ¢’ be such that vr, = 1 (mod q), then (t'r,)? = 
p (mod al, This contradicts hypothesis (1). O 


Sophie Germain’s celebrated theorem of 1823 is the following: 


(2B) If p is an odd prime such that 2p + 1 is also a prime, then the first case 
of Fermat’s theorem holds for p. 


Proor. It suffices to check that the primes p and q = 2p + 1 satisfy the 
hypotheses of (2A), which is quite easy. 
If p = a? (mod q), computing the Legendre symbol yields 


a 
+1= C = a972? = a? = p(modg) 


SO p = +1 (mod al, and this is impossible. 
Next, suppose x’ + y?” + z? = 0 (mod q) and q 4 xyz. Since p = (q — 1)/2, 
the little Fermat theorem implies that 
x? = +1 (mod q), 
y? = +1 (modą), 
z = +1 (modą). 


Nl 
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So 0 = xP + y? +z? = +1 +1 + 1 (modg), which is again impossible. 
And that is the proof! O 


Various comments are in order. 

Such a nice result inspired at once several generalizations. Indeed, using 
the same idea, though with a somewhat more detailed analysis, Legendre 
could prove the following result: 


OCH The first case of Fermat’s theorem holds for the odd prime exponent p, 
provided one of the following numbers is also a prime: 4p + 1, 8p + 1, 10p + 1, 
14p + 1, 16p + 1. 


With this theorem, Sophie Germain and Legendre covered all the primes 
p < 100 and thereby established the first case for these primes. Though it 
was only the first case, still it represented a considerable advance over 
previous attempts, even more so because it was proved as early as 1823. 

The method however has its limitations. It is difficult to extend it for 
primes p such that 2kp + 1 is prime, when k is large. This may be seen, 
examining the proofs given by Legendre. Secondly, the method does not 
work for the second case of Fermat’s theorem. 

More interesting is the problem it leads to. Given k > 1, what can be 
said about the set of primes p such that 2hp + 1 is also prime, for some h, 
1<h<k? 

More specifically, are there infinitely many Sophie Germain primes p 
(those such that 2p + 1 is also a prime)? This question is of the same order 
of difficulty as the well-known “twin prime” problem. 

I wish to state that the answer is positive, provided one assumes the 
following hypothesis H of Schinzel (1958): 

“Let s > 1, let f,(x),...,f,(X) be irreducible polynomials with integral 
coefficients; assume that the leading coefficient of f,(X) is positive and that 
no integer n> 1 divides all the numbers f,(m)f,(m)::- f,(m). Then there 
exists one (and it may be proved, necessarily infinitely many) natural num- 
ber(s) m such that f,(m), Tim... f(m) are all primes.” 

For example, taking f(x) = X, DO) = 2X + 1 this hypothesis would 
imply that there exist infinitely many Sophie Germain primes. Taking 
f(X) = X, f(X) = X + 2, it yields infinitely many twin primes. As a matter 
of fact, when the polynomials are linear, the hypothesis was made by Dickson 
(1904). 

For the case where m = 1, f,(X) = aX + b, with relatively prime integers 
a, b, the answer is positive and given by the famous Dirichlet theorem on 
primes in arithmetic progression. This is in fact, the only case for which the 
answer is known. 

It appears to be extremely difficult to prove hypothesis H, but just as 
difficult to disprove it—a matter for deep thought. 

Bateman and Horn, in 1962, looked at Schinzel’s hypothesis from the 
quantitative point of view. With a heuristic argument, they made plausible 
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that 
lim Q(N) = œ, 


N>% 
where Q(N) is the number of primes p < N such that 2p + 1 is also prime. 

Another result in this connection was given by Vaughan in 1973. Namely, 
the two statements (H,) and (H2) below cannot be simultaneously true: 

(H,) there exist an infinity of primes p such that 8p + 1 is a prime. 

(H2) for infinitely many n, d(n) = d(n + 1) [where d(n) denotes the number 
of positive divisions of n]. 

There have been more recent developments by Krasner (1940) and 
Dénes (1951). Krasner’s approach, as well as Dénes’s, are not completely 
naive. Indeed, Krasner appeals to a theorem of Furtwängler (which requires 
class field theory), while Dénes uses some results about cyclotomic fields and 
estimates for the size of the least exponent p for which the first case might 
be false. Here is Krasner’s theorem: 


(2D) Assume that p is an odd prime, and h is an integer such that: 
1. q = 2hp + 1 is a prime, 
2. 34h, 
3. 3#2 < 2hp + 1, 
4. 27" 4 1 (mod q). 
Then the first case of Fermat’s theorem holds for p. 
In 1951 Dénes proved: 


(2E) If p is an odd prime, if h is an integer, not a multiple of 3, h < 55 and 
ae that q = 2hp + 1 is a prime, then the first case of Fermat’s theorem holds 
or p. 


3. Congruences 


The idea now will be to derive congruences which must be verified by nonzero 
Pairwise relatively prime integers x, y, z satisfying 
xP + yP 4+ zP = 0, (3.1) 


The first result is due to Fleck (1909). It was repeatedly rediscovered by 
Frobenius (1914), Vandiver (1914), and Pomey (1923). [To his credit, I must 
add that Vandiver also proved a corresponding result valid for the second 
case; however, since it uses the methods of class field theory, I will postpone 
giving it until later. ] 


(3A) With the above assumptions on x, y, and z, if p does not divide x, then 


x?! = 1 (mod p°). 
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This is a rather stringent congruence, since by Fermat’s little theorem, 
it is only expected that x?~' = 1 (mod p). The following lemma, due to 
Sophie Germain is required for the proof (for notation, see §1). 


Lemma 3.1. If p does not divide x, y, z, then r, = 1 (mod p°), s, = 1 (mod p°), 
tı = 1 (mod p°). 


Proor (following Pérez-Cacho, 1958). I prove that r, = 1 (mod p?). It suffices 
to show that if q is any prime dividing r,, then q = 1 (mod p°). 

From alrı it follows that q|x, qi yz. Since gcd (r,r,) = 1, we have 
q¥ y +z. q|x implies that sł = z?~' (mod q) and tł = y?~* (mod q) so 
0 = yP + z? = yt} + zs? (mod d that is, —yt{ = zs} (mod q). But q|r; so 
qly? + zP,andq 4 y + z. By Lemma 1.1, q = 1 (mod p). Hence, raising to the 
power (q — 1)/p: 2471”? = — y4~)? (mod q). 

But if y’ is such that yy = —1 (mod q), then from z? = —y? (mod 1) it 
follows that (zy’)? = 1 (mod.g). So, from z# —y (modg), the order of zy’ 
modulo q is equal to p. 

But (zy’)@~ +” = 1 (mod p) so p divides (q — 1)/p, that is, q = 1 (mod p°), 
as it was required to show. ð 


Using this lemma, Theorem (3A) is proved as follows: 


Case 1. pX xyz. 





Then r, = 1 (modp?). Hence x = —rr; = —r (mod p?) and x? = —r? 
(mod p°). 

By symmetry 

y? = ab (mod p?) and z? = —tť (mod pî). 
Hence 
r? + sP + t? = 0 (mod p°). (3.2) 
From (1.8), x = — r” (mod p°) so 
x =x?(modp*) and x?~! = 1 (mod p°). 

Case 2. p|z, p X xy. l , , 

I omit the proof, since it is similar. It requires a lemma like 3.1, which 
asserts that if p|z then t; = 1 (mod p°). o 


The quantities r, s, t satisfy various divisibility properties. Vandiver (1925) 
and Inkeri (1946) are responsible for the following result: 


(3B) 
1. If pX xyz, then x + y + z is a multiple of rap" andr + s + t is a multiple 


of p°. ; 
2. If p|xyz, then x + y + z is a multiple of rstp? but r + s + t is not a multiple 


of p. 
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In particular, in the second case, r + s + t #0. 


But this is true in general, as shown by Spunar (1929) and again by James 
(1934), Segal (1938): 


BGC r+s+t#0. 


The next group of results involves congruences of the following type: 

(1 + x)” = 1 + x?” (mod p™*?), (3.3) 

It is convenient before proceeding, to state the following lemma of 
Ferentinou-Nicolacopoulou (1965): 


Lemma 3.2. If p is an odd prime and p does not divide a, nor a + 1, if m > 2, 
then (a + 1)?” = a?” + 1 (mod p"*) if and only if (a + Uer =a?" 41 
(mod p™*?), 


The special case where m = 2 had been noted by G. D. Birkhoff (see 
Carmichael, 1913). 


The next lemma was proved by Carmichael for m = 1 in 1913; Klösgen 
proved it in general (1970). 


Lemma 3.3. Let p be an odd prime, m > 1. Then the following conditions 
are equivalent: 
a. There exist x, y, z, not multiples of p, such that 
xP" + yP” + 22" = 0 (mod p™*?), (3.4) 
b. There exists a, 1 < a < (p — 3)/2 such that 
1+ a?" = (1 + a)?” (mod p™*?). (3.5) 
I omit the proofs of these lemmas, since they don’t offer much difficulty. 


In particular, if the first case fails for the exponent p, there exist x, y, z, not 
multiples of p, satisfying (3.1). Then 


x? + y? + z? = 0 (mod p°), 
hence 


1 + a? = (1 + a)? (mod p?). (3.6) 


But Carmichael and Meissner proved in 1913 and 1914 the more precise 
congruences: 


(3D) Let (3.1) be satisfied. Then 
1. If p X xyz, then there exists a, 1 < a < (p — 3)/2, such that 
1+ a? = (1 + a)? (mod p°). (3.7) 
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2. If p|xyz, then there exists a, 1 < a < (p — 3)/2 such that 
1+ a? = (1 + a)’ (mod p°). (3.8) 


Proor. I give only a proof of the first assertion. The other is handled in a 
similar way. By (3A) x? = x (mod p°), so x” = x? = x (mod p°). Similarly 
y” = y (mod p°), z” = z (mod p°). Hence by (3B): 


xP? 4 y” 4 cP? = x + y + z= O (mod p’). 


By the preceding lemmas, there exists a, 1 < a < (p — 3)/2 such that 
(1 + a)?’ = 1 + a” (mod p?) hence (1 + a)? = 1 + a” (mod p°). o 


The congruence (3.7) may be investigated by computers. Wagstaff verified 
(in 1975) that for every prime p < 10°, p = —1 (mod 6), the congruence 
(1 + X}? = 1 + Kr (mod p?) has no solution a, such that 1 < a < (p — 3)/2. 
This shows that the first case holds for such exponents p (I stated in Lecture 
II that the first case is known to hold for p < 3 x 10°). 

In 1950, Trypanis announced without proof the following strengthening 
of Carmichael’s result; this was rediscovered (and proved) by Ferentinou- 
Nicolacopoulou in 1965: 


(3E) If p > 5 and the first case fails for p, there exists a, 1 < a < (p — 5)/2 
such that 
(1 +a)” = 1 + a” (mod p*). (3.9) 


It is easily seen that (3.9) implies (3.7). A recent result of Gandhi (1976) 
follows at once from the above. 

Wells Johnson (1977) investigated whether the congruences of Carmichael 
and Trypanis may be further extended, modulo every power p"*? (n= 1). 
This is a situation best handled with p-adic methods. 

Let x; denote the unique p-adic integer such that ai ' = 1 and a, =j 
(mod p) (where j = 1,2,...,p — 1). The existence and uniqueness of o. follows 
from Hensel’s lemma. 


GF) If p is a prime and p¥ a, p{a + 1 then the following conditions are 
equivalent: 


a. (1 + a)?" = 1 + a” (mod p"*?) for every n 2 1. 
EE BEE TEEN 
c. a? +a+1=0(modp). 


It is worth noting that already in 1839 Cauchy had shown that (c) 
implies that (1 + a)? = 1 + a? (mod p>). In essence this is his proof: Since 
a? +a+1=0(mod p), — 3 is a square modulo p, so p = 1 (mod 6). 

But Cauchy had established the identity, when p = 1 (mod 6): 


(+X 1- X’ = REI + X + X + XIX), (3.10) 
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where fi (X ) is a polynomial with integral coefficients. For X = a this gives 
the required congruence. 


It would be interesting to know whether the following statement is true: 


(S) If a is an integer, 1 <a < p — 2, and if (1+ a)? = 1 + a? (mod p°) 
then a? + a + 1 = 0 (mod p). ' 

Assume that (S) is true. Then the first case of Fermat’s theorem would 
hold for every prime exponent p = 5 (mod 6). 

Otherwise, by (3D), there exists an a such that 1 + a? = (1 + a}? (mod p°), 
hence by (S) a? + a + 1 = 0 (mod p) and therefore p = 1 (mod6), a con- 
tradiction. 

A strengthening of (S) is false, as shown by Arwin in 1920; there exist 
integers a and a prime p such that (1 + a)? = 1 + a? (mod p°) and yet 
a? + a+ 1 #0 (modp). 


4. Wendt’s Theorem 


In 1894, Wendt gave a criterion for the first case, involving a certain matrix 
with binomial coefficients. But, because of the size of the matrix, his criterion 
is quite awkward. Even worse, it was later recognized, that it is essentially 
equivalent to Sophie Germaine theorem. Yet, TI] present it now, because 
of other interesting connections. 

For n > 2 let W, be the determinant of the n x n matrix: 


e OOG 
W, = det L ULI (41) 


n n n\ 
1 2 3 1 
This is a circulant. For example, W, = — 3, W = 2? x 7, W, = —3 x 53 


W = —3’ x 5° x 17°. Stern explicitly computed this determinant in 1871. 
The general result is the following: 


If f(X)=ay +a,X + +a 1X"! the circulant with first row 


(49,41, ... ,dy—4) is equal to []?22 TEL where 
„15 Hn- i= Us = 1, er. — 
nth roots of 1. I ° " Si r nza are the 


Since (1 + XY — X" =1 + (DX + (X27 + + EES) 


n-1 
w= Ea +e 1] (4.2) 


62 IV The Naive Approach 


The first result leading towards Wendt’s theorem is the following: 


(4A) Let p # 2, let q = 2hp + 1(h 2 1) be primes. Then there exist integers 
x, y, Z such that q ¥ xyz and 


xP + y? + 2? = 0 (mod4q) (4.3) 
if and only if al Wr, 


And now, Wendt’s theorem: 


(4B) Let p #2, let q = 2hp + 1 (h> 1) be primes. If q ¥ Wzn and p” # 1 
(mod q), then the first case of Fermat’s theorem holds for the exponent p. 


I wish to say only a few words about the proof. The fact that q ¥ W2, 
implies the congruence has only the trivial solution, may be established with 
routine arguments using linear algebra over the field with q elements. 

Moreover, if p = r” (mod q) then p*" = r??" = r17! = 1 (mod q), contrary 
to the hypothesis. Thus, the hypotheses of Sophie Germain’s theorem (2A) 
are satisfied and so the first case holds for p. 

Another corollary of Sophie Germain’s theorem was given by Vandiver 
(1926): 


(4C) If p is an odd prime, if q = 2hp + 1 is a prime, if qX Wzn, and 2h = 
2°p*, where p kr, k = 0, then the first case of Fermat’s theorem holds for p. 


For example, this provides a new proof of Legendre’s results for primes 
p such that 4p + 1, or 8p + 1, or 16p + 1 is also a prime. 

It is not known whether for every prime p there exists h > 1 such that 
q= 2hp + 1 is a prime and q ¥ W,,. This is a difficult problem. 

To be more specific, in 1909 Dickson proved that for every prime p there 
exists an integer qo such that if q is any prime, d > qo, then there exist integers 
x, y, z, not multiplies of q, such that x? + yP +z” =0 (mod q). By (4A), this 
means that q| Wz, for every prime q, q > qo, q = 2hp + 1. So, there are at 
most finitely many primes q of the form q = 2hp + 1 for which q 7 W,,. What 
is not known is whether one such prime q actually exists. 

I shall return to Dickson’s theorem in one of the later lectures. 

To conclude the discussion of Wendt’s determinant, I present several of 
its divisibility properties. 


(4D) 


1. If nis even, then W, = —(2" — 1)u?, where u is an integer. 
2. If d|n, then W,| W,- 
3. W, = 0 if and only if 6|n. 
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Less routine and more interesting are the next results. 
Emma Lehmer proved in 1935: 


(4E) If p is an odd prime then W,_, is a multiple of p?~? q,(2), where 


SI) 


2 
2 = —, 
p(2) P (4.4) 


The number q,(2) is an integer (by Fermat’s little theorem) called the 
Fermat quotient (with base 2). Pll return often to the Fermat quotient, in 
connection with various important theorems. 

In particular, Wieferich’s theorem of 1909 (to which I referred in Lecture 
II) states that if the first case fails for p, then q,(2) = 0 (mod p). Asa corollary: 


(4F) If pP"! EH, then the first case of Fermat's theorem is true for the 
exponent p. 


It is important, therefore, to determine the residue of W,_, modulo p”~ +. 
Carlitz proved in 1959: 


(4G) 


p-2 
W,-1= T] [0 +a -a — 1] 
a=1 


p-2 /p-1 a 
=p? I ( ALUS 2) (mod p?~*), 


s=1 
From this congruence, he was able to improve E. Lehmer’s result (1960): 


(4H) If p?*°? ER. then the first case of Fermat’s theorem is true for 
the exponent p. 


5. Abel’s Conjecture 


In 1823, Abel conjectured that if n > 2 and if x, y, z are nonzero pairwise 
relatively prime integers such that 


x" y= 2", (5.1) 
then none of x, y, z can be prime-powers. 
I have explained in my second lecture what may have been the reason 


for Abel’s conjecture. Perhaps he had in mind a descent on the number of 
prime factors of a would-be solution, but he left no written explanation. 
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This conjecture is still unsettled. To be more precise, only a very particular 
assertion still needs to be established. To fix notation, lt 0 < x < y < zZ. 

In 1884, Jonquiéres proved that y is not a prime, and if x is prime, then 
z-y=1. 

Let me just pause to say that up to now the possibility that there exist 
numbers x, y, n for which 0 < x < y and x" + y" = (y + 1)" cannot yet be 
ruled out. 

Jonquiéres’ result was rediscovered by Gambioli in 1901. 

Lucas proved more in 1891: y, z have at least two prime factors. He also 
had an alleged proof that x is not a prime power, however in 1895, Markoff 
pointed out a gap. Again, in 1905, Sauer rediscovered the result of Lucas. 

Mileikowsky gave a new proof of Lucas’s theorem in 1932. He has also 
shown that if n is not a prime, then x cannot be a prime-power. 

All these partial results are contained in the following recent result of 
Möller (1955). 


(5A) Let m > 1 be an odd integer with r distinct prime factors, let u > 0. 
Let 0<x<y be relatively prime integers and a=y’”"+x°””, b= 
vim — x2"™, Then: 


1. aand b have at least r distinct prime factors. 

2. If a has exactly r distinct prime factors, then r = 1 and a= 23 + 13 (so 
u = 0,m = 3). 

3. If b has exactly r distinct prime factors, then b = (x + 1)" — x" (so u = 0, 
y=x41). 


Proor. I sketch the proof of the first assertion. This is a place where I need 
Lemma 1.2. Replacing x, y by x7", vi, vi", it may be assumed that u = 0. Let 
Pi,.--, p, be the distinct prime factors of m. Since y + x # 0, by Lemma 1.2 


y" +x" = (y +x): Qay, +) 


is a multiple of (y + x) [];-ı Q,,(y.£). The integers Q,,(y,+x) are also 
pairwise relatively prime, so a, b have at least r distinct prime factors. 
The other two assertions again need Lemma 1.2 for their proof. 


As a corollary: 
(5B) If n is an integer with r distinct odd prime factors, if OÒ < x < y <z 


are pairwise relatively prime integers such that (5.1) holds, then: 


1. z, y have at least r + 1 distinct prime factors. 
2. x has at least r distinct prime factors; moreover, if x has exactly r distinct 
prime factors, then n is odd and z = y + 1. 


Proor. Let n = 2m with u > 0. By (5A) z" = x" + y" and y" = z" — x" have 
at least r + 1 distinct prime factors (note that n 4 3 and z — x > 1). 
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Similarly, from x" = z" — y", x has at least r + 1 distinct prime factors, 
except when z = y + 1 and n is odd, in which case, it can only be said that x 
has at least r distinct prime factors. o 


And now, more explicitly. 


(SC) Let n> 2, let O< x< y< z be relatively prime integers satisfying 
(5.1). Then 


1. z, y are not prime powers. 
2. If x is a prime power, then z = y + 1 and n is an odd prime. 


PROOF. 


(1) If z is a prime power, so is z" = x" + y". By the preceding result, n is a 
power of 2, n > 4, and this contradicts Fermat’s theorem, which is true for 
such exponents. The same argument is enough to show that y is not a prime 
power. 

(2) If x is a prime power, by (5B), n = p°, e > 1, p an odd prime and 
z= y + 1. It remains to show that e = 1. 

Assume e > 1 so zP°' — y?*"' > 1 and 


AË = z _ MÉI 
= (Gë D VE" "(ze Dee? A ee Ze ++ vie (5.2) 


Hence p does not divide 


ee” < p 
z — y = (y +1)” — yr = >} ( po 
Zei J 
because p divides all but the last summand. So x?“ is the power of a prime 
q # p. Hence q divides both factors of (5.2), that is, 


ype = PI 


' (mod al and (5.3) 
Eidel Leift ze we +. + ylP Derm! = py? De" = 0 (mod q) 

Thus q divides y and therefore by (5.3), q divides z = y + 1, a contradiction. 
o 


With more refined, but still elementary methods, Inkeri proved in 1946 
that if0< x< y< z, xX” + y’ = 2’, and p% xyz, then z — y > 1 and so x is 
not a prime-power. TU return to this question in my lecture on estimates. 


6. Fermat’s Equation with Even Exponent 


As I shall indicate, it is possible to prove the first case of Fermat’s theorem 
for even exponents. Clearly, it suffices to consider the exponents 2p, where p 
is an odd prime. 
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The first result in this connection was obtained by Kummer, in 1837. It 
is his first paper on Fermat’s equation and it is written in Latin. Later, this 
result was rediscovered many times (Niedermeier, 1943; Griselle, 1953; 
Oeconomu, 1956). 

The best theorem concerning the exponent 2p was published by Terjanian, 
in December 1977. It is indeed quite surprising that his proof, which requires 
only very elementary considerations, was not found beforehand. ll not 
jump to the conclusion that perhaps there is also a simple proof of Fermat’s 
theorem awaiting to be discovered. I would rather say that Terjanian’s result 
shows that the first case of Fermat’s theorem for an even exponent is far 
easier than for a prime exponent. 

I begin with Kummer’s theorem: 


(6A) Let n > 1 be an odd integer. If there exist nonzero integers x, y, z such 
that x?" + y?" = z” and ged(n,xyz) = 1, then n = 1 (mod 8). 
Proor. It is possible to take x, y, z positive and relatively prime. A simple 
observation tells that x may be assumed even, while y, z are odd. Write 
gen SH yn 


(6.1) 


x?" az gon PER Vin SS (z? = vi x = — 7 
and observe that if the two factors on the right are relatively prime, then they 
are 2nth powers. 

z? — y? is even, and 

e e 2(n-1) | -2(n-2),,2 2(n-1) 

pop” +2 dë EE EE (6.2) 
is a sum of n odd summands, hence it is odd, therefore of the form k?”, with 
k odd. 

Each summand on the right is of the form (2a + 1} = 4a(a+1)+1=1 
(mod 8). i 

Thus (6.2) becomes an equality of the form 


8b + 1 = (8a, +1) +--+: + (8a, + 1). 
Therefore n = 1 (mod 8). O 
For example, the first case of Fermat’s theorem holds for 2n = 14. 


Kummer’s theorem was extended by Grey in 1954 and by Long in 1960. 
Just for the record, I quote one of Long’s results: 


(6B) If n is an integer whose last digit (in decimal notation) is 4 or 6, and 
if x, y, z are nonzero integers such that x" + y" = z", then gcd(n,xyz) > 2. 


Now I shall give the proof of Terjanian’s theorem, which contains all the 
above results as corollaries. Once more, as in §1, it is question of the quotient 
z—y" 


Q,(z,— y) = z—y > 
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U mn are nonzero relatively prime integers, n odd, n > 3, let (Œ) denote 
the Jacobi symbol defined by (1 = 1 when m is a square modulo n and 
Œ) = —1 otherwise. 


Lemma 6.1. Let y, z be distinct nonzero integers. 
1. Ifm=ng+r,0<r<n<m, then 
Qm(2, =Y) = ZQ", — y")O,(z, — y) + y", — y). 
2. [fm=ng—1r,0<r<n<m, then 
Om(2, =y) = [2"~"Q,- 12", ~y") + y"~"]Q,(2, - y) yT O, — y) 


3. If z, y are odd, relatively prime, z = y (mod 4) and m is odd, then Q,,(z,— y) = 
m (mod 4), so Q,,(z,— y) is odd. 


4. If z, y are odd, relatively prime, z = y (mod 4) and m and n are odd natural 


numbers, then 
(G-A 2 (" 
May) " 
Proor. The assertions (1) and (2) follow at once from the definitions. 
(3) Let z = y + 4t. Then 


+ 4)" — 
Onl2,—y) = HIT. Een + (le free 


= my"! = m (mod 4), 


because m — 1 is even, y is odd, so y""! = 1 (mod 4). 
(4) The assertion is proved by induction on m + n. It is trivial when 
m=n=I1. Lem+n>2. 


If m >n, then there exist an integer r, odd, 0 < r < n, and q such that 
m=qn +r orm = qn ~r. 
If m = qn + r, then m — r is even, so by (1) and induction 


(>) zZ Ke = >) r ar m 
Q,(2,— y) Q,(z, — y) Q,(z,— y) u n > (=) 
Ifm = qn — r, then m — n and n — r are even, so by (2) and induction 
(ge) = >) = —Q,(z,—y) 
ÄER ~ y) Q,(z, Ke y) u WË Sek y) 
elt 
Qaz =y) Q=) Los, all 


By (3), Q,(z,y) = n (mod 4). If n = TP, pf, then it is easy to check that 
n—1=)i-,(p; — le; (mod 4), so 


(=) =|] (=) G (= (KI: @1- ier — (Us. 
i=1 

















Pi 
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Since Q,(z,y) = n (mod 4), 





Q,2,-y)-1_n-1 
; = —— (mod 2) 


(ec) 5 (=): 
(iso) EOE 


Now, if m < n, by Jacobi’s reciprocity law and the above proof 


SI = (— Diet "91" D HOn =D >) 
0,(2,—y) Oml2,— Y) 
Ym-1)%n-1 n 
= (—1) ( ) 2¢ 1 
- (2) = 
n 


After this lemma, Terjanian’s result follows almost at once: 


hence 





(6C) Let p be an odd prime. If x,y,z are nonzero integers such that 
x?P + y?P = z?P, then 2p divides x or y. 
Proor. There is no loss of generality in assuming that x, y, z are pairwise 
relatively prime. Also x, y cannot be both odd, since this would imply that 
x?” = y? = | (mod 4) and hence that z?” = 2 (mod 4), which is impossible. 
Let x be even, so y, z are odd. Then 


zP — vi 
x20 = z? — Eee a ze 
By Lemma 1.2 
2p __ vip 
SIE — Ya) =porl. 


Ifthe greatest common divisor is p, then p divides CR. so 2p divides x. 

I show now that it is not possible that z? — y? and (z?? — y??)/(z? — y?)are 
relatively prime. If they are, both must be squares. But z? = y? (mod4). 
Since p is not a square, there exists a prime d such that p is not a square 
modulo q. It follows from Lemma 6.1 that 


1 (?) — EE 
NR q Q,(2?,- y’) ` 


which is absurd, because Q,(z”, — y?) is a square. 





This concludes the proof. g 
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7. Odds and Ends 


In this final section, I have grouped results of many different kinds. I can see 
by these deductions, that their authors enjoyed toying with Fermat’s equation 
and were fully aware that their contributions would not solve the problem. 
Yet, not only are the proofs sometimes rather elegant or simple, but some 
of these results have been rediscovered and used in more substantial ways. 

I start with a few divisibility properties. Massoutié and Pomey proved in 
1931 the following result: 


(7A) If p= -1 (mod 6) and x, y, z are non-zero integers such that 
xP + y? + z? = 0, then 3 divides x, y, or z. 


In 1934, Pomey claimed also to have proved a similar result for all p > 2 
and also with 5, instead of 3. According to A. Brauer (1934) his argument was 
erroneous. Inkeri proved in 1946 the weaker statement for 5: 


(7B) If p > 2, p # 1, 9 (mod 20) and if x, y, z are nonzero integers such that 
xP + y? + z? = Q, then 5 divides x, y, or z. 


There are various other partial results of this kind in the literature, but 
I shall refrain from giving them. 
Other curiosities follow. Swistak proved in 1969: 


(7C) If p > 2, if x, y, z are positive pairwise relatively prime integers, and 
xP + y? = z”, then p divides (x), p(y), and o(z) (where o denotes Euler’s 
totient function). 


Goldziher proved the next result in 1913. It was rediscovered by Mihaljinec 
in 1952 and Rameswar Rao in 1969: 


(7D) Positive integers x, y, z in arithmetic progression cannot satisfy the 
equation x" + y" = z" (with n > 2). 


Rameswar Rao proved also: 


(TE) If n > 2 is an odd integer and if x, y, z are positive integers such that 
x" + y" = 2", then u(x + y) =0 (u denotes the Möbius function). 


Here are several statements which are equivalent to Fermat’s theorem. 
They were given by Pérez-Cacho in 1946. The equivalence between conditions 
(1), (2) and (3) was first proved by Bendz in 1901, and was rediscovered by 
S. Chowla in 1978. 


(TF) Let m >.2,n = 2m — 1. Then the following statements are equivalent: 


1. The equation X" + Y" = Z" has only the trivial solutions in Z. 
2. The equation X(1 + X) = T” has only the trivial solutions in Q. 
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The equation X? = 4Y” + 1 has only the trivial solutions in Q. 

The equation X? = Y"*! — 4Y has only the trivial solutions in Q. 

For every non-zero rational number a, the polynomial Z 2 — a"Z+a 

is irreducible over Q. 

The equation (XY)" = X + Y has only the trivial solutions in Q. 

The equation X™ = (X/Y) + Y has only the trivial solutions in Q. 

. If uy, r are nonzero rational numbers, if uy, U2, . . . is a geometric progres- 
sion of ratio r, then u? — u, + r #0. ~ _ 

9. If A is a triangle with vertices A, B, C, if the angle CAB = 90°, if HN =2, 

and if |AB| + |BC| is a nth power of a rational number, then |AC| is not 

rational. 


Lä P Loi 


oma 


Moreover, these conditions imply: 


10. The tangents to the parabola Y? = 4X at every rational point distinct from 
the origin, cut the curve Y = X” at irrational points. 


In conclusion, I would like to mention a nice paper of Hurwitz (1908). 
He considered the diophantine equation 


xmy" + Y™Z" + Z™X" = 0, 


where m > n, gcd(m,n) = 1 (without loss of generality), and he proved: 


(7G) The above equation has the trivial solution alone if and only if Fermat's 
theorem is true for the exponent m? — mn + n. 


For example, letting m = 3, n = 1 it follows that 
X7Y + ¥°Z4+ EN 
has only the trivial solution. 
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LECTURE V 


Kummer’s Monument 


This lecture is about Kummer’s famous theorem: Fermat’s last theorem is 
true for every exponent p which is a regular prime. I will explain how Kummer 
arrived at the notion of a regular prime and show why his approach to the 
problem may be considered quite natural. In some sense, it forces itself on us. 


1. A Justification of Kummer’s Method 


In my first Lecture, I already gave a historical survey of what was known 
about the problem circa 1835-1845. 

To show the difficulties one might encounter and to show how natural 
Kummer s approach was, I wish to make some preliminary remarks. The 
first idea which has been used, time and time again, is to write 


Z2 = xP + y” (1.1) 


then to express x? + y” as a product of factors which are pairwise relatively 
Prime and therefore must themselves be pth powers, since integers factor 
uniquely into primes. 

For p = 3 this was done by Gauss (see Lecture III) using cubic roots of 1. 
It will work in general: 


XP + yP = (x + yx + EAX + Cy) Lei DEI (1.2) 
Here € represents a pth root of 1, namely, the complex number 
ma... An 
C= cos" + ising. (1.3) 
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It is necessary to be sure that there are precisely p pth roots of 1, all 
being powers of ¢, and that 


1+ C+ 07 EE 0? 4 =0. (1.4) 


To follow the main idea, it is necessary to give some sense to the statement 
that the various factors x + y, x + Cy, x + C7y,...x + (Py are pairwise 
relatively prime. This requires an a priori notion of divisibility. In such a 
theory, a quantity is an integer exactly when it is a multiple of 1. So “integers 
must be defined. Since the only complex numbers which are relevant have 
to do with ¢ and its powers, and since the ordinary operations of addition, 
subtraction, multiplication, and division have to be allowed, Kummer was 
led to consider those complex numbers which are obtained from ¢ and the 
rational numbers using the above operations. These numbers have the form 


Ap Laf + ant? +++ Lë af `, (1.5) 


A summand involving CP "1 is unnecessary, since it may be expressed by 
means of (1.4) in terms of lower powers of ¢. 

The totality of such numbers is the field Q(¢). Every element satisfies an 
algebraic equation over Q, of degree at most (and in fact dividing) p — 1. 

The elements of the form (1.5) where each a; is an integer, are called the 
cyclotomic integers of OU) They form a ring Z[{]. Each ordinary integer is 
a cyclotomic integer. 

If x, ße OCL a divides B when there exists y e Z[¢] such that «y = ß. One 
writes a|. l l 

The notions of divisibility of cyclotomic integers and ordinary integers 
share various properties. 

However, and this is the main point, there are at least two major differences. 
First, there are cyclotomic integers (not equal to 1 or — 1), which divide 1. 
For example, the numbers 1, — 1, ¢ — 4, 07, —¢?, .. OT, —¢P 71. In fact, 
I will show that there are many others. From the point of view of divisibility, 
they play no role, so they are called units, and their union is the multiplicative 
group of units. 7 

Suppose two nonzero elements «, ß are such that « divides ß and B divides 
a. In this case a, are called associated elements and the notation « ~ $ is 
used. It is obvious that a, ß are associated if and only if «/f is a unit. 

An example is the following: 


1-Er1-0 whenl<kAj<p-1. 


Indeed, let j be such that jj’ = k (mod p). Then 





S ON ip gip. p UDI eza. 
101-0 E E E & eZ[C] 


In the same way (1 — ©/)/(1 — ¢*) e Z[C]. 
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The existence of units forces an indeterminacy in describing elements with 
given divisibility. Lumping together all the elements with the same behavior 
is therefore a reasonable idea. 

Following the model of ordinary arithmetic, a cyclotomic integer is 
prime if the only cyclotomic integers which divide it are either associated to 
it, or units. So essentially primes have no proper divisor. 

It is not at all difficult to show that every cyclotomic integer is a product 
of primes. However, it can be shown that it is false, in general, that if « is a 
prime and « divides fy then a | Bore | y- It is known that if this property were 
true, then the unique factorization theorem would hold. That is, every 
cyclotomic integer would be, in a unique way (up to units) equal to a product 
of primes. Conversely, the unique factorization theorem would imply the 
above property. 

It took some time before mathematicians were convinced that in general 
unique factorization will not hold for cyclotomic integers. This is the second 
essential difference with ordinary arithmetic. Kummer himself found, some- 


_time later, by an indirect method, the first example of this possibility—the 


field of 23rd roots of 1. 

Now the crucial point in any proof of Fermat’s theorem would have to 
be the knowledge that the factors in (1.2) are pth powers. This conclusion 
being unwarranted, Kummer invented certain “ideal numbers” such that 
the factors in (1.2) would become pth powers of these ideal numbers. 

He had not only to invent these ideal numbers, but also to extend to them 
the language of divisibility and ensure that for these numbers, unique 
factorization holds. The next problem was to measure to what extent unique 
factorization for cyclotomic integers was lacking. 

The success of Kummer’s theory of cyclotomic ideal numbers prompted 
its generalization to other fields of algebraic numbers. This work was done 
by Dedekind, who reinterpreted Kummer’s ideal numbers as being certain 
subsets in Z[{] (or even in OCH which constitute the Dedekind ideals 
(respectively, fractional ideals). Kummer’s ideal numbers were closer to the 
modern-day notion of “divisors”, which embodies all the features for the 
local and global study of number theory. 


2. Basic Facts about the Arithmetic of 
Cyclotomic Fields 


For the sake of fixing notations, I'll recall now some of the basic facts about 
cyclotomic fields, which we will need. All this, and much more may be found, 
with proofs, in any of the standard textbooks on algebraic numbers. I will 
adopt the current modern terminology. 

Let p 2 3 be a prime number. Let ¢ = ¢, be a primitive pth root of 1. 
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K = Q(Q is the cyclotomic field (corresponding to p). Its elements are 
the complex numbers dy + a, ++ a,-2[? "` (with ag,a1, . ,a@,—2 € Q). 

A = Z[¢] is the ring of cyclotomic integers: dy) + a, Lt E ee 
(with do,a, ... ,dy—2 € Z). 

The minimal polynomial of ¢ over Q is the pth cyclotomic polynomial: 


P] 
ge KEXP KEL 


It is an irreducible polynomial with roots {¢,¢7,...,¢?~*}. The trace and 
norm of ¢ are 


TQ = C+ +--+ P71 = -1, (2.1) 
MO) = 1. (2.2) 
Moreover 
(X) = DË W (2.3) 
and 2 
iz [] d—-¢) =p. (2.4) 


i=1 

Every a e A may be written in unique way in the form indicated, so 
{Gesa 2? is an integral basis of A. The elements « e A are roots of monic 
irreducible polynomials with coefficients in Z. The basis {1,¢,... ,¢?~ 2) has 
discriminant si 

d= det(Tr(¢'*")); j=0,1 bens p-2° 

d is also the discriminant of any other integral basis. 

A computation yields 


d = (—1)P (Nipp, (2.5) 
A nonempty subset I of A is an ideal if it satisfies the following properties: 


1. Ifa, Bel, then « + B,a— Bel. 
2. Ifae 1, Be A, then af € I. 


Among the ideals there are I = 0 (consisting only of 0) and J = A. If 
o, Me A the set {$71 Bia;|B1,-.-.B, € A} is an ideal of A. It is said to 
be generated by {a,...,% = If æ € A, then the set Au = {Ba|ße A} is called 
the principal ideal generated by o. 

Ideals of A may be added and multiplied together. If I, J are ideals: 


I+J={a+Bßlael,BeJ)}, (2.6) 


1-1} aplm> 0,916 1 Bre Jp. (2.7) 


i=1 
I +J, IJ are again ideals. It is easy to establish the rules of operations with 
ideals. 
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If I,,I,, I; are ideals and I,J, = I, then I, and I, are said to divide I. 
If P is an ideal, P # 0, P # A, and the only ideals dividing P are P and A, 
then P is called a prime ideal. It may be shown that I divides J if and only if 
I> J (as sets). Thus prime ideals are maximal among the ideals different 
from A (as subsets of A), and conversely. 

If Aw = AB + O then a/ß and p/a are both in A, so ß divides « and o 
divides ß. In the sense already indicated a, ß are associated (a ~ ß) and 
a/f, B/x are units of A. The group of units of A is denoted by U; it is a subgroup 
of the multiplicative group K of nonzero elements of K. 

The fundamental theorem for ideals is the following: 


(2A) Every nonzero ideal of A is, in unique way, equal to the product of 
(not necessarily distinct) prime ideals. 


This theorem is also valid for any algebraic number field, as was shown 
by Dedekind. 

It follows that the rules of divisibility of ordinary integers remain valid 
for the divisibility of ideals. In particular, it is possible to define the greatest 
common divisor gcd(I,J) and the least common multiple lom(I,J) of ideals I, J. 
Since I, divides J, if and only if J, > I, then 


gcd(IJ)=I1+J,  leomUJ)=IaJ. (2.8) 


If x, Be I the congruence « = ß(mod I) means that « — Be I, that is, I 
divides the principal ideal Aa — EL Congruences modulo ideals satisfy pro- 
perties analogous to ordinary congruences of integers. The set of congruence 
classes modulo J forms the residue ring A/T; it is a field if and only if I is a 
prime ideal. 

It is of course important to describe the prime ideals of A. It is easy to see 
that if Q is a prime ideal of A it contains exactly one ordinary prime number 
q. Since q e Q, Q divides the principal ideal Ag, so Q is among its prime ideal 
factors. This leads to determining the prime ideal factors of Aq for any 
prime number q. There are two cases: q = p and q ¥ p. 


(2B) Let 1 = 1 — ¢ € A. Then Aj is a prime ideal containing p, Ap = (AA)? ~} 
and A/AA is the field with p elements. 

Because Ap is the product of p — 1 ideals equal to AA, the prime p is said 
to be ramified in the cyclotomic field K = Q(£). 

For any prime q # p, Kummer also indicated how to proceed. Let F, be 
the field with q elements and let ®, e F,[X] be the polynomial obtained from 
®, by reducing its coefficients modulo q. Let 6,= Foe... Feci (eo > 
1,...,e,_, = 1,r > 0), be the decomposition of ®, as product of DOWER of 
irreducible polynomials of F,[X]. Then: 


(2C) With notation as above, r divides p- l, e= =e =l. If 
f = (p — 1)/r, then each polynomial F; has degree f. The ideal Aq is equal to 
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the product of r distinct prime ideals, say Aq = Qo` Q1°.-.* Q,-1- Each 
A/Q; is the unique field of degree f over F,. The prime ideals Q; correspond to 
the factors F, as follows: if F; e Z[X] and F; reduced modulo q is F;, then Q; 
is generated by the two elements q, FAL, Moreover, f is the order of q modulo 
p, in the multiplicative group F,- 


Thus, each prime q # p is unramified in the cyclotomic field (since 
eg = =e,-,= 1). 

The cyclotomic field K = Q(£) is a Galois extension of Q; its Galois 
group is cyclic of order p — 1. If g is any primitive root modulo p, then the 
automorphism o of K such that o(£) = (? is a generator of this Galois group. 

The conjugates of ¢, i.e., the roots of ,(X) appear in pairs of complex- 
conjugate numbers (none are real numbers). It is convenient to label the 
conjugates of ¢ as follows: ¢ = ¢,¢,..., ¢", where c&* (with z= 
(p — 1)/2) is the complex-conjugate of ¢®, for i= 1, 2,..., 7. 

The Galois group transforms any ideal J of A into other ideals, which are 
the conjugates of I. If I is a prime ideal so are its conjugates. Actually, it is 
possible that an ideal is equal to certain ofits conjugates. In the decomposition 
of any prime q into prime ideals as indicated in OC) the prime ideals 
Qo, - - -, Q,- are conjugate to each other. 

The product of an ideal J and all its conjugates gives a new ideal NU), 
called the norm of I. This is necessarily the principal ideal generated by an 
ordinary integer; in symbols N(/) = An. The number of elements of the 
residue ring A/I is the absolute value |n]. 

To measure the existence of nonprincipal ideals in A, it is convenient to 
introduce the fractional ideals. These are subsets I of K such that there exists 
ae K, a #0 for which a! = {aß|ß e I} is an ideal of A. Thus, every ideal of 
Ais a fractional ideal; K itself is not a fractional ideal. The set of nonzero 
fractional ideals of K is a multiplicative abelian group under the operation: 


IJ = d abilo: € I, nei, 
i=1 


This group is denoted by ./d(K). Among the nonzero fractional ideals ofK 
are the principal fractional ideals, namely those of the form Aw = {Balb E A}, 
where a + 0. They constitute a subgroup of d(K), denoted by P #d(K). 

Two nonzero fractional ideals I, J are said to be equivalent when there 
exists a nonzero principal fractional ideal Aa such that I = (Aa)J. This is 
indeed an equivalence relation; the nonzero principal fractional ideals are 
those equivalent to the unit ideal. Each equivalence class of ideals is called 
simply an ideal class. The class of the ideal J shall be denoted [J]. The set 
of ideal classes forms a multiplicative abelian group, namely the quotient 
group Jd(K)/P Jd(K), which is called the ideal class group (or, simple, the 
class group) of K and is denoted by @¢(K). 


(2D) The group of ideal classes of K is finite. 
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For a proof of this theorem, which is true for any algebraic number field, 
see [Ri], page 123. 

The number h = h(p) of ideal classes of K = Q(¢,) is called the class 
number of K. Thus h = 1 if and only if every ideal of K is principal. This is 
equivalent to saying that every algebraic integer is equal, up to a unit, to the 
product of powers of prime algebraic integers (those whose only factors are 
units or associated numbers). 

The structure of the abelian group €/(K) is easily described. For every 
prime q, let €/,(K) be the qg-primary component of the ideal class group 
€¢(K). It consists of all ideal classes having order a power of q; in other words 
€l AK) is the q-Sylow subgroup of €/(K). By the structure theorem for 
finite abelian groups, €/(K) is isomorphic to the direct product of the groups 
€l K). It should be noted that @/,(K) is nontrivial if and only if q divides 
the class number h of K. Moreover, each @/,(K) is isomorphic to the direct 
product of cyclic groups having order a power of q. If q‘ is the exponent of 
the group €/,(K), that is, the maximum of the orders of the elements of 
€l (K), then the order of every element of @/,(K) divides 4 (this is a well- 
known property of finite abelian groups). 

The following easy result is true in general for finite abelian groups: 


(2E) If h = gu, where q is a prime not dividing u, e > 1, and if q is the 
exponent of 6¢,(K), then t < e. 


i Some indications about how to compute the class number h will be given 
ater. 

Turning to the group of units of the cyclotomic field, let W be the group 
of roots of unity belonging to the cyclotomic field. Then 


W = {156, SR Pt E ... u 


The structure of the group of units is as follows: 


(2F) U = W x U’, where U’ is the free abelian multiplicative group with 
r = (p — 3)/2 generators. 


That is, there exist units £4, . . . , ¢, in K such that d # 1 (for every j # 0) 
and such that every unit e e U may be written in unique way in the form 


P wey? .. Ze: 


with w e W,e;e Z(ii=1,...,r). This isa particular case of Dirichlet’s general 
theorem (1846) on the structure of the group of units of any algebraic number 
field (see [Ri], page 148). 

{€1,...,€,} is a fundamental system of units of K. Any two fundamental 
system of units have the same number of elements. Expressing the units of 
one fundamental system in terms of those of another, considering conjugates 
their absolute value and logarithms, it may be seen that the following real 
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number is indeperident of the system of units: R )|, where 


logle{| - - - logle?| 
L= : : ; 
log|e?| - - - log|et”| 
R is called the regulator of the cyclotomic field K. l 
The explicit determination of a fundamental system of units turns out to 


be, in general, extremely difficult. I will indicate later its connection with the 
class number. 

Among the units, I have already mentioned the roots of unity (they are 1, 
—1,¢, —¢,...,C?73, EC HI and the elements (1 — ¢/)/(1 — ©), where 1 < 
j#k<p-I1 

Certain units are real numbers, for example the Kummer units 
(Kreiseinheiten): 


ô, = a = a = = i (positive square root) 
for k = 2, . . . , (p — 1)/2. (It is necessary to prove that 
1-& 1-¢" 
Teg Ee 


is a square in K; this follows from (2G) below). 

Let U* be the totality of real units of K. It is a subgroup of the group U 
ofall units of K. Let V denote the subgroup of U*, generated by the (p — 3)/2 
Kummer units. It is important to compare these three groups V c U* c U. 
First, Kummer proved: 


(2G) Every unit of K is the product of a root of unity and a real positive 
unit. 
The Kummer units are multiplicatively independent: if 
(p~ 1)/2 


u Ak = 1, then e,='' = ep -13/2 = 9. 


To say that they form a fundamental system of units amounts saying that 
V = U*. Thus the index of V in U* measures how far V is from U* and it 
will be connected with the class number. 

Later I will mention the deepest of Kummer’s results about units, which 
is valid under a special hypothesis about the class number. 


3. Kummer’s Main Theorem 


In 1847, Kummer announced his main theorem in a letter to Dirichlet, who 
then communicated it to the Academy of Sciences of Berlin. 

In this proof, published in 1850, Kummer claims that Fermat’s equation, 
for “regular” prime exponents, has only the trivial solution in the cyclotomic 
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field Q(¢). Yet, his proof contains an “unaccountable lapse” (in Weil’s 
expression; see Kummer’s Collected Papers, vol. I, Notes by A. Weil, page 
955). This concerns the unjustified assumption that a solution a, ß, y in the 
cyclotomic field may be chosen such that «, ß, y have no nontrivial common 
divisor. However, Kummer’s argument did establish without any gaps, that 
there are no (nontrivial) solutions in ordinary integers when the exponent 
is a regular prime. Later, in 1897, Hilbert succeeded in adapting Kummer’s 
proof, so as to exclude solutions from the cyclotomic field. 

At the beginning, Kummer proved the theorem under two hypothesis 
about the exponent p: 


Hypothesis 1. The pth power of a nonprincipal ideal is never a principal ideal. 
Hypothesis 2. If « is a cyclotomic unit and there exists an ordinary integer 
m such that «= m (mod Ap), then there exists a unit ß such that a = ß?. 


In his second communication to the Academy, Kummer writes: 


My proof of Fermat’s theorem, which Mr. Lejeune Dirichlet has com- 
municated to the Royal Academy of Sciences, is based on two hypotheses. At 
that time, I could not in general decide which primes satisfied these hypothesis. 
For this purpose I was lacking the expression for the number of nonequivalent 
classes of complex ideal numbers, about which, already for a long time, 
Mr. Dirichlet has promised an article. After waiting for its publication, I have 
undertaken, with the help of some oral suggestions from Dirichlet, to derive 
the required expressions, and I have succeeded not only to discover them, 
but also to base, on these expressions, both hypotheses in my proof of Fermat’s 
theorem. 


Later, I will indicate how Kummer succeeded in characterizing the ex- 
ponents p for which these hypotheses hold. As a matter of fact, he actually 
proved that if the first hypothesis is satisfied then the second one follows 
automatically. 

A role is played in the proof by the semi-primary integers. An element 
a € A is Said to be semi-primary if x £ O (mod AA) but there exists an ordinary 
integer m such that a = m (mod AA). The following lemma will be useful 
(it is easy to prove): 


Lemma 3.1. 
1. If «=m + nå (mod AZ”), with m, n ordinary integers, m # 0 (mod p) and 
if lis an integer such that Im = n (mod p), then C'a is semi-primary. 


2. If a, Be A are semi-primary, there exists an integer m such that «= mp 
(mod 44°). 


In the next lemma, I indicate what can be proved without making any 
hypothesis concerning the exponent. 


Lemma 3.2. Let p > 2 be a prime. Assume that a, B, y € A and let 
I = ged(A(w + ¢*B)|k = 0,1,...,p — 1). 
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1. If ®+ß?+y’=0 and if AX y, then for every k = 0,1,..., p— 1 there 
exists an ideal J, of A such that 


Ala + CB) = JE. (3.1) 


The ideals Jo, Jı,- .., Jp-1 are pairwise relatively prime and not mul- 
tiples of AA. 

2. If a? + BP = eöPi”?, where e is a unit of A, Ae A and A does not divide 
a, P, ô, then m > 2. 

3. If oP + BP + y? = Oand if y = 61", with m > 1, ô e A, À not dividing a, P, ô, 
then there exists jọ, O < jo < p — 1, such that for every k = 0, 1,...,p— 1 
there exists an ideal J, of A, such that 


Ala + CB) = (AAPM TP, 
Aa + PB) = (AATJE (when k # jo), 


where I’ = gcd(Aa,Aß). The ideals Jo, J4, . . . Jp—1 are pairwise relatively 
prime and not multiples of AA. 


(3.2) 


PROOF. 
(1) To begin 


—yP=aP + p = D (o + CB). (3.3) 


First I note that if1 < j < k < p — 1, then ged(A(« + EB), Aa + C*B)) = I. 
This is quite easy. Indeed, if P is any prime ideal, e > 1, and P* divides both 
ideals A(x + C/B), A(x + C*B), then 

a+ CB e P 
a+ CB e P. 


Then, taking the difference yields 
(CF — HB = OU — Be Pe. 


But 1 —¢* J~1—€=4,s0 Ape Pr and PAAR 

Similarly (Ch — (a OTT — 1)a e P* and P*| AAa. But Pf AA, other- 
wise P| Aå, so P = AA (both being prime ideals) and from (3.3) AA | Ay against 
the hypothesis. It follows that Pr divides Aw, Aß and therefore P*| A(a + ¢'B) 


for every i=0,1,...,p — 1. This is enough to prove the assertion. 
Let 
A k 
y= ee fork=0,1,..., p-1, (3.4) 
so Jo, Jj,..-,J-1 are pairwise relatively prime ideals, not multiples of 
Aa and 


Aaf PZI 
ZI) = TK 
I k=0 
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Then each J, is the pth power of an ideal Ji, = J?, in virtue of the unique 
factorization theorem for ideals. 

(2) Multiplying «, B by any pth roots of 1 still gives algebraic integers 
satisfying the same equation af + BP = eö?A”?. Thus, by Lemma 3.1, it may 
be assumed that «, D are semi-primary integers. 

Assume that m = 1. Since a, ß are semi-primary, there exist a, be Z such 


that 
« = a (mod AAT and $= b (mod 42°). 
Since Ap = AAPT!, 
a? = a (mod Aj?**) and PP = b? (mod Ad?*?), 


Since m = 1, a? + bP =a? + BP + pd? ** = eAP(ôP + & 1d), where pe A, AYO. 
If p° is the exact power of p dividing a? + b” (with e > 0), then the exact 
power of AA dividing A(a? + b”) is a multiple of p — 1, because Ap = AAT. 
Since 4 FA AA ¥ A(6? te pd), so the exact power of A) dividing the ideal 
AleAP(6P Ce Toi is AA?—a contradiction. This proves that m > 2. 
(3) From 


sët = aP + BP = TI (œ + OI (3.5) 
k=0 


it follows that there exists j, 0 < j < p — 1 such that AA| A(a + CB). 
But then 44| A(« + DÉI for every index k. Indeed, if k + j, then a + ¢*8 = 
(æ + EB) + AT DB and DI 1 ~ A. So 


a+ß a+ CR ESA 
H H A + er* a H 


These elements are pairwise incongruent modulo AA. Otherwise, 
a + OB =a + CB (mod AA?) 


with j # k. So A?|(¢/ — CB ~ AB and A|B, against the hypothesis. 

Since the number of congruence classes modulo A is p, there exists an 
unique index jo, such that (x + ¢/°B)/A? = 0 (mod AA), that is, 4° |a + CB and 
A? Y a + CB for k # jo. So from (3.5) 

(AAP TPD AAT DT divides A(x + af) 
and p(m — 1) + 1 > 1 since mz 2 (by part (2)). 
If I’ = ged(Aa,Aß), then I'| A(x + (ët for every k and AA Y I’. Hence 
Ais + CB) = (APT DHII, 
Ala + CB) = (AAW, (for k # jo), 
where Jo, Ji,...,4,-1 are ideals, not multiples of AA. 

These ideals are pairwise relatively prime: if P is a prime ideal dividing 

Ji, J (k +j), then (AAP divides Ais + CP), Ala + C*B), hence also 


AB — E*B) = (AA)AB, so I’P|AB. Similarly Pl Aa, hence I'P divides 
I’ = gcd(Ax,Aß), which is impossible. 


A. 





(3.6) 
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Multiplying the relations (3.6) yields 


(Aö)PF(AA)”? = Tl Ala + CB) = (AAT Jo Jp- 


SO 
(Ad)? = TI Sy. 


By the unique factorization of ideals, for each k there exists an ideal J, 
such that J, = JẸ and clearly AA FI, moreover these ideals are pairwise 
relatively prime. 

In view of (3.6) this concludes the proof. oO 


A prime p is said to be regular if p does not divide the class number 
h = h(p) of the cyclotomic field K. 


Lemma 3.3. The prime p is regular if and only if the pth power of a non- 
principal ideal of A is never a principal ideal. 


Proof. By definition p is regular when p does not divide the order h of the 
class group @/(K). This means that the p-Sylow subgroup of @¢(K) is trivial. 
In other words, there is no element [J ] in @¢(K) having order p: if [J]? = [A], 
then [J] = [A]. Or equivalently, if J” is a principal ideal, then J is already 
a principal ideal. o 


I have indicated already that Kummer succeeded in proving that first 
hypothesis implies the second. This is the contents of his difficult and impor- 
tant lemma on units: 


Lemma 3.4. If p is a regular prime, if « is a unit of A such that there exists 
m e Z satisfying the congruence « = m (mod Ap), then « = B?, where D is a unit 
of A. 


In order to prove this lemma, Kummer introduced 4-adic methods. I shall 
give a full proof of this lemma in my forthcoming other book on Fermat’s 
theorem. 

And now, finally, here is Kummer’s main theorem: 


(3A) If p is a regular prime, there exist no (nonzero) cyclotomic integers 
a, B, y such that af + BP + y? =0. 


Proor. By Gauss’s theorem for the exponent 3, the assertion is true for 
p = 3.1 shall assume the theorem false (hence p > 3) and will consider the 
traditional two cases. 


First Case. a, ß, y are not multiples of 4 = 1 — ¢. 
I may assume p > 7. Indeed, if p = 5, then a, p, y are congruent modulo 
AÀ to the integers +1, or +2, because A/A/ = F; (the field with 5 elements). 
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Then «5, £5, y$ are congruent to +1, or +32, modulo 445 (note that 
A5 = AA*). In any case, «ê + ß° + y5 #0 (mod AA°), so the theorem would 
be true for the exponent 5. 

Thus, let p > 7. 

Multiplying o, ß by any root of 1 still gives algebraic integers satisfying 
the same relation. So, by Lemma 3.1, it is possible to assume, without loss 
of generality, that a, ß are semi-primary integers in K. 

By Lemma 3.2 

Alx + C*B) = Kl (k =0,1,...,p— 1), 
where the ideals Jo, J,,...,J,-1 are pairwise relatively prime and not 
multiples of AA, and I = ged(A(a + B), A(z + CB),..., Ala + (PB). 

Since « + CET #0 (mod AJ), because AA does not divide Ay, there 
exists a root of unity C such that Cla + ¢?~'B) is a semi-primary integer. Let 

q B 
CSS Te p= a Beim: 
C(x + PP) a + PP) 
Hence a + {1 ="! and Ala + CB) = (J; /Jp-1)? fork =0,1,...,p— 2. 

The fractional ideals (J,/J,_,)” are principal, and p is a regular prime. 
By Lemma 3.3, J,/J,-ı is also a principal ideal, say Jl, = Alur/n) 
(for k=0,1,...,p — 2), where u, EA, ny € Z and Hp, n, have no common 
factor (not a unit) in A. We note also that AA does not divide Au,, Any. 

Taking into account (2G), then 


al + Cpt = oy E (<k<p-2) 3.7) 
k 
where ¢, isa real unit of K,0 < c, < p — 1.Since A/A/ = Fp, Hx = m,(mod AA), 
where m, € Z. Hence uf = m? (mod AAP) because Ap = Ad?~'. Therefore 
n,(a’ + CB’) = em? (mod AA?) 
(for k = 0,1,...,p — 2). Considering the complex-conjugates, we have 
na + CEB) = e ` mp (mod AJ?) 
(because 4 = 1-7" is associate to À). Hence eymf sti "met CB) = 
Cena’ + C~*B’) (mod AZ”) and since AA does not divide An, 
ai + DÉI = C + £7 *B") (mod AP). (3.8) 
We now evaluate the exponents o Since a, ß, ¢(a + {P”!ß) are semi- 
primary integers, it follows from Lemma 3.1 that there exist rational integers 
a, b such that 
a = al’(a + {P~ +p) (mod Ad”), 
B = bio + {P~ 1B) (mod 44°). 
Since AA A(x + {P"!B), 


a’ + CB’ = a + b (mod 422). 
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Similarly 7 
a’ + CtP =a + CC H (mod 44°). 


But & = (1 — AN = 1 — jå (mod AA?) for every je Z. Hence 


a+b—kbé= (1 — 2c,4)(a + b + kbA) (mod AA?) 
so 
2c,(a + b)A = 2kbi (mod AA?) 
hence 
cla + b) = kb (mod 44). 
Since a, b, c,, k € Z, 
c,{a + b) = kb (mod p). 


From a + ("18 = (a + C?>*b)C'(a + (Pt) (mod AA?) it follows that 
1 = (a + ECH (mod AZ). 
But ¢ = 1 (mod AA), hence (?"! = (’= 1 (mod AA), so 
a+b=1(mod AA), 


hence a + b = 1 (mod p), so c, = kb (mod p) for k = 0, 1,..., p — 2. 
In particular 


Co =O(modp), c,=b(modp), c,=2b(modp), c, = 3b (mod p). 
Thus we may rewrite (3.8) as follows (for k = 0,1,2,3): 

a’ + BB = pol, 

x + Cp’ SS Che = g-ig = pA’, 














= E 3.9 
a + Cp Dë ef = pol? ES 
a’ + CoB en Wed? BW E = p34’, 
where po, P1» P2» P3 € A. 
Consider the matrix of coefficients 
11-1 -1 
1 14 = t+ = Gre 1 
M = 1 ge {t ee 
1 E: Sa SS 
By Cramer’s rule 
pM) p dom y M) g _ e: 
~ det(M)’ ~ det(M)’ ~ det(M)’ ~ det(M)’ 


where M, is the matrix obtained from M by replacing the ith column by 
the right-hand side of the relations (3.9). Thus det(M,) € AAP and since AA 
does not divide Aa’, Ap’, Ax’, Af’, then necessarily Ad? divides det(M), that is 


1-90 — da — DN ID — EYE? — £7?) = 0 (mod AP). 
(3.10) 


3. Kummer’s Main Theorem 89 


We discuss several possibilities. 


1. b = 0 (mod p): then ß = 0 (mod AA), against the hypothesis. 

2. b= 1 (mod p): then B =a + ß (mod AA), hence « = 0 (mod AA), against 
the hypothesis. 

3. n#0, 1 (mod p) and 2b # 1 (mod p): then all the factors in (3.10) are 
associated with A, thus 4? divides 46, so p < 6, against the hypothesis. 

4. b #0, 1 (mod p) and 2b = 1 (mod p): then 28 = a + B (mod AA) soa = f 
(mod AA). 


In view of the symmetry of the relation of + BP + y? = 0, we must also 
have « = y (mod AA). But af = a, B? = B, af = y (mod AA), because A/AA = 
Fp. It follows that « + ß + y = 3a = 0 (mod Ad) so « = 0 (mod AA) contra- 
dicting the hypothesis. So, we have again reached a contradiction, concluding 
the proof for the first case. 


Second Case. À divides «, p, or y. 
We assume without loss of generality that / divides y. 
We may write y = 64”, where ô e A, A does not divide ô, and 


of + BP = — OPAMP, 
So there is a relation of the form 
of + BP = eða", (3.11) 


where ¢ is a unit, 2 does not divide 6 and m is minimal. By Lemma 3.2, m > 2. 
It follows that 4 does not divide « otherwise A also divides ß and from (3.11). 
after dividing by AP, we would obtain a similar relation with exponent 
(m — 1)p for A, against the minimality of m. Similarly A does not divide £. 

Our purpose will be to derive a relation analogous to (3.11) with smaller 
exponent of A. By Lemma 3.2, after changing ß into Op (without loss of 


generality), we may write 
Ala + B) = (AAP) TIT JB, 3.12) 
Ala + CHB) = (AAI (fori <k < p— 1), 


where I’ = gcd(Aa,Af), and the ideals Jo, J,,..., Jp—1 are pairwise relatively 
prime and not multiples of A2. Then 


a + CR IN -p(m-1 Jy i 
a(R -an ( I I<k<p-l. (13) 





This shows that the fractional ideals (J,/J 9)? are principal and since p 
is regular, J,/Jo is also principal. Thus there exist elements u, E€ A, n, €Z 
such that J,/Jo = dipl, Since AA does not divide J, (0 < k < p — 1), we 
may assume that 4 does not divide the elements up, n,. So there exist units 
€x of A such that 


(o + Epa D = ela + air (1 < k SP - 1). 
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In particular, if k = 1, 2, then 


(a + BAD = oa + P) GT 


ny 


P 
(a + Dën = eala + P) E | 
2 
Multiplying the first relation by 1 + ¢ and then subtracting the second 
relation gives 


(a + pyre» SE SIE +L)- Ye] 
ni n, 


(u2n1)”E2 = 4 ‘jplm— 2) 
alto a(t tr 


But 1 + Cis a unit of A and the above relation has the form: 
(a)? + EP = 6 (SPAM VP, (3.14) 


where a = pna, ß = Hani, di = nin, A Y Ò’, and e, e” are units. This is not 
yet like relation (3.11), but we shall transform it into a relation of that type. 

Since p(m — 1) > p, AP divides («’)? + e'(f')’. But AP’ = Anan; is relatively 
prime to AA, i.e., AB’ + AA = A, so there exists an element x € A such that 
KB’ = 1 (mod AA). Then vëft = 1 (mod A4”) and (eo + € = 0 (mod AAP). 
So there exists pe A such that eg = of (mod AAP). But A/AA = F, so there 
exists re Z such that p =r (mod AA). Then e& = p’ =r? (mod AAP). By 
Kummer’s Lemma 3.4 on units, e’ is the pth power of a unit £4 in Arel = (e})? 
and we may rewrite (3.14) as follows: 


Hence 


(nın). 





(un)? — 


(op + (Ep) = e” (0) Pa De, (3.15) 
This is now a relation like (3.11), with m — 1 instead of m. This contradicts 
the choice of a minimal m, and the proof is concluded. oO 


In particular, if p is a regular prime, there exist no nonzero integers 
x, y, ze Z such that x? + y? + 2 =0. 

In order to understand the force of Kummer’s theorem, it will be necessary 
to find out which primes are regular. Is there any neat characterization of 
regularity? This, and other questions, will be considered in the next lecture. 
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LECTURE VI 


Regular Primes 


To utilize the force of Kummer’s main theorem, it is imperative to deter- 
mine when a prime p is regular. In theory, at least, this is simple. All one needs 
to do is to compute the class number of the cyclotomic field K, = Q(¢,), 
where ¢, is a primitive pth root of 1. And so Kummer began by discovering 
formulas for the class number. However, it soon became apparent that the 
computations involved were much too difficult, and other more suitable 
criteria for regularity were needed. These were also discovered by Kummer. 

I begin this lecture by giving all these classical results of Kummer and 
then I will describe more recent developments in this area. 


1. The Class Number of Cyclotomic Fields 


Kummer established formulas for the class number of cyclotomic fields in 
a series of very important papers (1850, 1851). A proof of these formulas 
may be found, for example, in the book by Borevich and Shafarevich, or in 
Hilbert’s Zahlbericht. 

One formula involving the class number and other invariants of the 
cyclotomic field may be attributed to Dirichlet. 

I recall the notation. Let p be an odd prime, let £ = ¢, be a primitive pth 
root of 1, let K = K, = Q({) and A = A, = ZIL the ring of integers of the 
cyclotomic field K. 

For every real number t > 1, let o(t) be the number of ideals I of A having 
norm N(N) < t. Then 


a(t) MP 3/2_(-1/2R 
= p” 





lim 


t>o t 


h #0, (1.1) 


94 VI Regular Primes 


where h = h(p) is the class number of K and R is its regulator (defined in the 
preceding lecture). 

There is an important connection between the class number and the 
Dedekind zeta-function of the field K. 

It is well-known that the Riemann zeta-series ye ı 1/n" defined for s real, 
s > 1, converges uniformly and absolutely, for every 6 > 0, in the interval 
[1 + ô, 00). It defines therefore a continuous function for 1 < s < œ: 


C(s) = Ir (1.2) 


(Lal is the Riemann zeta-function. ¢(s) admits the following representation 
as an infinite convergent product (Euler): 





1 
=[| SR (for s > 1), (1.3) 
1-7 
P 
(where the product extends over the set of all primes p). 
The difference 
(ai — (1.4) 
remains bounded when s tends to 1 (s > 1). In particular, 
lim (s — Diels 1. (1.5) 
s>1+0 


With every algebraic number field F, it is possible to associate its Dedekind 
zeta-function (when F = Q this gives the Riemann zeta-function). To do this, 
for every integer n > 1 let y(n) be the number of ideals J of the ring of algebraic 
integers of F such that N(I) = n. For every real number s > 1 the series 


=, vin) 
GER , e (1.6) 





(where the sum is over the set of all ideals J of the ring of integers of F) con- 
verges uniformly and absolutely, for every ô > 0, in the interval [1 + ô, œ). 
Therefore it defines a continuous function of s, 1 < s < œ, called the Dedekind 
zeta-function of F: 





1 
= . 1.7 
Cr(s) N)‘ (1.7) 
This zeta-function also admits a Euler product expansion: 
fr(s) = [| —— (for s > 1) (1.8) 


P 
l- NPY 


(where this product extends over the set of all prime ideals 9). 
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It may be shown that if K = Q(0), then 


. 2p 3V2 P-1V2R 
lim (s — Ds) = Ee h£0. (1.9) 


s>1+0 





Formulas (1.1) and (1.9) are clearly not suitable for the computation of 
the class number. They have to be transformed into something more 
appropriate. 

Let m > 1 be an integer. A mapping y:Z > C (complex numbers) is a 
modular character, belonging to the modulus m, when the following condi- 
tions are satisfied: l 


1. y(a) = 0 if and only if ged(a,m) # 1. 
2. Ifa = b (mod m), then x(a) = x(b). 
3. z(ab) = x(a)x(b). 


It follows that |y(a)| = 1 for every integer a, ged(a,m) = 1. 

Among the modular characters with modulus m there is the principal 
character modulo m, denoted yo, namely: yo(a)=1 when ged(a,m) = 1, 
Xola) = 0 when gcd(a,m) # 1. 

If m divides m’ and y is a character with modulus m, it induces a character 
x with modulus m’, which is defined as follows: if gced(a,m’) # 1, then y’(a) = 
if gcd(a,m') = 1, then zial = y(a). 

Let y be a character with modulus m, If there exists a divisor d of m and a 
character y’ with modulus d which induces y, then y is said to be imprimitive; 
otherwise y is a primitive character. If y is a primitive character with modulus 
m, then its modulus is called the conductor of x, denoted by f,. In particular, 
the principal character of modulus 1 is primitive and has conductor equal to 1. 

If x is a character with modulus m, the series 3 7.1 Sing is called the 
L-series of x. For every 6 > 0 and every character y with modulus m, the 
L-series of y converges uniformly and absolutely in the interval [1 + 6, 00). 
Hence it defines a continuous function L(s|z) of s on the interval (1,00): 








Ula- $ 27 (for 1 < s). (1.10) 
n=1 
The function L(s|x) admits the product representation 
1 
L(s| y) = for 1 < s}. (1.11) 
lx I — ( 
p° 
For the principal character ko, with modulus m: 
1 
L(s|xo) = [] (1 — 5) Ks) (for 1< s). (1.12) 
plm 


Actually, if X Æ Xo, the domain of convergence of the L-series of y is 
larger: for every ô > 0 it converges uniformly on the interval [6,00). Hence 
it defines a continuous function on (0,00), still denoted L(s|x). 
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If x is a character with modulus m, if ë = Ce is a primitive mth root of 1, 
say č = cos(2z/m) + isin(2z/m), the sums 


T(x) = 5 y(ae*  (k=0,1,...,m— 1) (1.13) 


ged(a,m)= 1 
a=1 


are called the Gauss sums belonging to the character y (and to £). In particular 


5 (m) when 7 = Zo, 
tolx) = a -f 1.14 
i BECH TI" In ` when z # xo. 014 


The principal Gauss sum is 
mMma=nw= A. ak. (1.15) 


If x is a primitive character with conductor f, then Golf = f- 
Now if K = OCL then its Dedekind zeta-function is expressible as a 
product involving the L-series of the characters modulo p: 


Cals) = TI Lelo — (for s > 1) (1.16) 
and therefore 
Cx(s)= [] Lies) &(s) (for s > 1). (1.17) 


From this may be deduced the following expression for the class number 


h(p) of Q(¢,): 
pr? 


h(p) = 70-320- DIR x D Ld Ei (1.18) 


In order to compute h = h(p) it is still necessary to evaluate Lg) for 
X # re It may be shown that 








1’--I1__f 2 con 
Lis|y=- 3 zo >= | (for s > 1). (1.19) 
P k=0 n=ı M 
This gives 
p°’? 1 p-1 1 
h(p) = >@-I27@- DER x pr 2 JI IE 1 (x) log =}. (1.20) 


This last formula has an advantage over the previous one. It does not 
involve any infinite product or series. However it is still not suitable for 
direct computation, since the integer h is expressed in terms of logarithms 
and complex numbers. To put it in appropriate form it is necessary to evaluate 
L(1|z) when x is an even character, that is, x(a) = x(— a), and when x is an 
odd character, that is, x(a)= — x(— a). 
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If x is even, then 





L(l|x) = DIE ëieel a =) (1.21) 
If x is odd, then 
, Ai 
Lila: zn 2. Wik. (1.22) 


This gives the important formulas: 


(1A) The class number of K = Q(£) is h(p) = h,(p)h.(p) where 





1 _ 
hy(p) = Qpe 27 GG") Gm? *)| (1.23) 
and 
pP 312 (p—3)/2 |(p~3)/2 ; i 
ban = x II | È n° logit i], (1.24) 
R k=1 | Zen 


where n is a primitive (p — 1)th root of 1, g is a primitive root modulo p, 
g; = g? (mod p) with 1 < g; < p — 1 and 
p-2 
G(X) = }, gt. 
j=v 
The numbers h,(p), h2(p) are called respectively the first factor and the 
second factor of the class number of K = Q(Q). 
The formula for the first factor may be also written in the following ways. 
First: 


bunt = y(p)- IT Lol, (1.25) 
xeS 
where S is the set of odd characters modulo p and 
p We pit 34 
v(p) = 2p Lë = a 32 DZ (1.26) 
Secondly: 

1 pol 
big = —G-an IL 3. MEI, (1.27) 

(2p) eS k=1 





where S was defined above. 

At this stage of the theory nothing yet may be said about the nature of the 
numbers h,(p), h2(p), but it will soon be seen that they are integers and in fact, 
they admit an arithmetical interpretation. For this purpose it is necessary 
to study the field K* = Q(¢ + ¢7 +) which is equal to K ^ R. K* is called 
the real cyclotomic field. 

The degree of the extension K* |Q is (p — 1)/2. The ring of integers A* of 
K* is equal to A* = Z[C + CT The regulator of K* is Rt = Bis" 302, 
where R is the regulator of K. 
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Comparing the Dedekind zeta-series for K and K* leads eventually to 
the following result: 


(1B) The second factor h,(p) of the class number of K is equal to the class 
number of K* = Q(¢ + E IL In particular, h,(p) is a positive integer. 


For this reason, the second factor is also called the real class number of 
OU) and denoted by ht = h*(p). From now on, I shall use this notation, 
since it is more suggestive. 

Another interpretation of h*(p) is the following: 


(1C) h*(p) is the index of the subgroup V in the group U*. 

Recall that U* denotes the group of positive real units of K and V is the 
subgroup generated by the Kummer circular units 
E= De i = Gn 
1-¢ 1-67! 
(it suffices to take k = 2,...,(p — 1)/2). 


If g is a primitive root modulo p, let 6 = ö,. If o is the generator of the 
Galois group G of Q(¢)|@ such that o(¢) = C9, let 





(1.28) 


li 








On 


A= |det(log o'* (5); a ets vm 59/2 (1.29) 
Then the second factor may be written as 
(p- 3)/2 
h*(p) = a (1.30) 


The Kummer units are independent, since the index (U* : V) is finite. They 
constitute a fundamental system of units if and only if h*(p) = 4. 

The number h(p)/h*(p) = h,(p) is also called the relative class number of 
Q(¢) and denoted by h* = h*(p). From now on I shall use this notation. 

Kummer also showed: 


(1D) h* is a positive integer. 


In principle, formula (1.23) allows us to compute the first factor of the 
class number—at least when p is not too large. As I shall soon indicate, 
Kummer made extensive computations. On the other hand, formulas (1.24) 
and (1.30) are still unwieldy, the point being that the regulator is extremely 
difficult to compute. Indeed, it would require the knowledge ofa fundamental 
system of units for K. This is the crucial difficulty. 

Recall however that all that is needed, is to know whether the prime p 
divides the class number. 

In 1850 Kummer succeeded in proving the important: 


(1E) p divides the class number h if and only if p divides the first factor h*. 
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To establish this theorem, Kummer introduced transcendental methods. 
More precisely, for the first time he made use of p-adic methods (without 
calling them by this name). Such methods appeared explicitly only with 
Hensel (1908). So, once more, Kummer was a pioneer of genius. 


2. Bernoulli Numbers and Kummer’s Regularity 
Criterion 


In 1850, Kummer proved the following beautiful condition for regularity: 


(2A) p divides h* if and only if there exists an integer k, 1 < k < (p — 3)/2, 
such that p° divides the sum RZ] CH 


Proor. I sketch the main points in the proof. By (1.23), h* = |u|/(2p)'"' 
where t = (p — 1)/2, u = G(n)G(n°)  - - G(y?~?). 

From |u| = h*(2p)'~', it follows that u is independent of the choice of g 
(a primitive root modulo p) and of ņ (a (p — 1)th root of 1). It is convenient 
to choose g such that op Is 1 (mod p?), which is always possible. 

Let B = Z[n] be the ring of integers of the field Q(y). To show that p 
divides h* is equivalent to proving that if P is any prime ideal of B dividing 
Bp, then P divides B(u/p'"'). 

Let P be such a prime ideal. By changing n appropriately into some other 
(p — 1)th root of 1, it is possible to insure that P|B(1 — gn~'). This may be 
seen by considering the decomposition of Bp into a product of prime ideals. 

Since u = G(n)G(m?) -- + Gë 7), P divides B(u/p'"') if an only if there 
exists L odd, 1 < | < p — 2, such that P divides B(G(n')/p), that is, P? divides 
BG(n'). It may be seen that P does not divide BG(y?~) (by the choice of n). 
Since 


p-2 
1—g?~* = [| (1 — gn’) = 0 (mod p°), 
j=0 
P? divides 1 — gy?~?, so g = n (mod P?). Therefore 
p-2 ; 
Gill = > gg” (mod P?). 
j=0 


Since p is unramified in B, P? divides BG(n') exactly when p? divides 
AE ê gg”. Putting g; = g! + a,p (mod p°), with integers aj, yields 
gi" = (1 + 1g" — Ig* (mod p?) 
(for j = 0,1,....,.p — 2). Hence 


p-2 p-2 ` 
L git =(l+ (ps a") (mod p°). 
j= j= 
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EE + 1, p? divides 3 E o gjg” if and only if p* divides 3 EZ git! = 
poi j'*). Putting 2k = l + 1, this gives the stated condition. 


The sums 33. j* have been studied by Fermat, Pascal and Jakob 
Bernoulli. 


(2B) For every integer k > 0 there exists a polynomial S,(X) € Q[X] with 
the following properties: 


1. S,(X) has degree k + 1 and leading coeffiicent 1/(k + 1). 
2. (k+ 1)!8,(X) e Z[X] 
3. for every n> 1, S,(n) = } 5-1 F. 


These polynomials are determined recursively as follows: 


So =X 
and if k > 1, then 


k+1 k 
e T )sun+ d 2 saree eum Su 


k 
= (X + III —L (21) 
Moreover, S,(X) has no constant term. 


Proor. This is an easy induction on k. oO 


For example: 
So(X) = X, 
S,(X) = 4X? + 4X, 
SX) = 4X? + 3X? + 8X, 
S3(X) = 4X* + 3X? + 4X7. 


In view of (2B), each polynomial S,(X) may be written in the form 


EE AT WR EE (air? 
k+1 
seep ( S ba (2.2) 


where the coefficients b,; are rational numbers. 

Euler discovered the noteworthy fact that bj, = b,, for all j > k. This 
remarkably simplifies the computation of the successive polynomials S,(X). 
But, even more is true, the coefficients b,, satisfy a recurrence relation and 
are also intimately connected with a power-series development. 

To be more explicit, the formal power series 

eX —1 1 


1 1 
=| 2 2 A 
KEE (2.3) 
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is invertible, since its constant term is 1. Let the inverse be written as follows: 


xa Bot ix + 





24 SA 
ER X e 8 ER ; (2.4) 


where B; are rational numbers (i > 0). For example, a simple computation 


gives Bo = 1, B, = —4, B, =3. 


(2C) The numbers B; satisfy the following recurrence relation: 


ki k+1 +1 
( EE ‘ EBEN S )a+1=0 Ga 


and for everyk > 1: 


k ki) 
(e+ a) = x1 = ( WE H Ju" 


dp (" S S B,X. (2.6) 


Proor. Formula (2.5) is easily obtained by considering the coefficients of 
the powers of X in the product of (e* — 1)/X and X/(e* — 1). 
For each n > 2 let 


U(T) = KIT + eT treff + eP- DT), 


where T is an indeterminate. 
The coefficient of T**! is equal to 





1 2% (n — 1} 
Te SECH sge- n. (2.7) 
On the other hand, 
ent — 7) T 
U(T) = kIT = ki x a vie — 1) 





=k! E = r) ( > 7 l (2.8) 


Comparing the coefficient of T**! in (2.8) with (2.7): 


k B, ykti-i 
E 
en (È, IFI- z} 


hence 


k+1 
K+ Dsn- = tt + i Lu Ban 


k+1 
tea S ) Bu 
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Since this holds for every n > 2 and S,(n) = n* + S,(n — 1), 


k+1 k+1 z 
a+ 050 = x -( + Jar + ( 5 laun 1 


1 
k+1 
ea ie Bx oO 
It is easily seen from 
-X X X 





X © B, 
ze =1 2k yk 
ef Se 1 = 2 a H k! 
that 

Borst = 0 for k = 1. (2.9) 


The numbers B; are called the Bernoulli numbers. They appear for the 
first time in Jakob Bernoulli’s posthumous work Ars Conjectandi (1713, 
page 97). Bernoulli computed B, for k < 10. 

Euler rediscovered the Bernoulli numbers, while calculating the formula 
for S,(n). He computed B, for k < 30. 

As will be seen, the Bernoulli numbers play a fundamental role in con- 
nection with regularity of primes, and so Ill return to them repeatedly. The 
algebraic and arithmetic properties of the Bernoulli numbers are quite 
fascinating and the literature about them is very extensive. 

One of the main theorems about Bernoulli numbers was communicated 
by von Staudt to Gauss. The proof was published by von Staudt, in 1840, 
just after Clausen published the statement of the theorem. This theorem 
gives the exact value of the denominator of any Bernoulli number. 


(2D) If k = 1, then 
1 
Bat % ed (2.10) 


p prime 
p-112k 


There are numerous proofs of this theorem, for example: Hardy and 
Wright give the proof by Rado (1934). 
Write 


Ba = Sn with Day > 0, réie, Dal = 1. (2.11) 
2k 


Then D,, is square-free and a prime p divides D,, if and only if p — 1] 2k. 
In particular, ol Da (for every k > 1). 
Moreover, it follows also that if p — 1|2k, then: 


pB,, = — 1 (mod p). (2.12) 
Another congruence which may be established is the following: 
(2E) For every integer m > 2, 
mN x = D24,S2,(m — 1) (mod m’). (2.13) 
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To make this precise, if p is any prime number, a rational number a/b 
(with a, b integers, b 4 0, gcd(a,b) = 1) is said to be p-integral if p does not 
divide b. If moreover, p divides a, then p is said to divide a/b. If a/b, c/d are 
rational numbers, then a/b = c/d (mod p) when p divides (a/b) — (c/d). The 
properties of such congruences are analogous to those of the ordinary 
congruences of integers. 

So to say that p divides the Bernoulli number B,, = N,/D,, means that 
p|N>, (and of course p X D,,, hence p — 14 2k). 

Though it is completely known which primes divide the dominator of 
Bernoulli numbers, it remains almost a complete mystery to predict which 
primes divide N,,. Yet, according to the next regularity criterion of Kummer 
(1850) this is precisely what matters. 


(2F) p divides h* (that is, p is not a regular prime) if and only if p divides the 


numerator of one of the Bernoulli numbers Bz, B,,..., By-3. 
Proor. By (2.13) (with m = p) and by (2A), p divides h* if and only if p? divides 
PN a, (where 2 < k < p — 3), that is, if and only if p| Na,- o 


The situation now has been much improved. It suffices to determine the 
Bernoulli numbers with indices up to p — 3, by means of the recurrence 
relation (2.5), and to determine whether their numerators are multiples of p. 
The only problem lies in the fact that as the index grows, so does the numer- 
ator of the corresponding Bernoulli number—in a rather dizzying way. 

For example 

__ 2577687858367 


34 7 6 


which is still manageable. However, the numerator of B,,, already has 250 
digits. And in conjunction with Fermat’s problem, the computations have 
been pushed much further ahead, requiring further developments of the 
theory of Bernoulli numbers. 

Right now, I only wish to say that Kummer discovered that 37, 59, 67 are 
the only irregular primes less than 100 (in 1850, 1851). Later in 1874, he 
showed that 101, 103, 131, 149, 157 are the irregular primes less than 164. 

For example 37|B3,,59| B44. As for p = 157 it divides both B,, and B410. 


3. Various Arithmetic Properties of 
Bernoulli Numbers 


As I have already intimated, a deeper study of the divisibility properties of 
Bernoulli numbers is essential, because the absolute value of their numer- 
ators grows very rapidly. 

This follows from a celebrated formula by Euler, connecting the Bernoulli 
numbers and the values of Riemann’s zeta-function at even positive integers. 
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First, Euler established the series development of the cotangent function 





(for z a complex number, z| < n): 
1 S 1 
mr wei 
æ x 2k 
re 


Because of the absolute convergence of the series, the order of summation 
may be interchanged, yielding: 


1 ES Eil 
cotz=--2) OO — (3.2) 
Zz ea T 
From this, it follows: 
_, 2(2k)! 
By, =(-1*"! On ¢(2k) (for k > 1). (3.3) 


Thus, knowing the Bernoulli numbers is essentially equivalent to knowing 
the values of the Riemann zeta-function at the positive even integers—a 
point which is worth stressing. 

Often quoted are the special cases: 


© 
È 
n=1 


For k sufficiently large, ((2k) may be easily computed to a high degree of 
accuracy. Since the fractional part of Bay is known, by the theorem of von 
Staudt and Clausen, B}, may be determined with an error less than 4. This 
method has been used in the past to build tables of Bernoulli numbers, but 
has its own limitations, since the series for {(2k) converges very slowly. 

From (3.3) it is seen that Bz, > 0 if and only if k is odd. It is also seen with- 
out difficulty that the sequence |B,,| is strictly increasing (for k > 4). 

By means of Stirling’s formula: 


n! ~ nen") (3.4) 


nt 


n? SC | 
zen AAT 


Ss RER 


it follows that 
k 2k 
|Bo,| ~ Ak (=) (3.5) 
me 
H 
and also that for every M > 1 
lim [Bas 
kra Ok)” 


All this shows how hopeless is the exact determination of the numerator 
Nax, except for rather small values of 2k. 

One of the most useful congruence properties of Bernoulli numbers was 
already discovered in 1851 by Kummer himself. He had the idea of con- 


= 00. (3.6) 
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sidering the function 


B 
2k) = == m 
f (2k) ak odulo p, 


defined for integers 2k such that p — 1 /2k. Note that in this case, by the 
theorem of von Staudt and Clausen, B,,/2k is p-integral and it is possible 
to consider its residue class modulo p. 

Kummer showed that the function f has period p — 1. Explicitly: 





Bogs p-1 — Ba, 


TEST modp). (3.7) 


This is nothing but a special case of the following more general con- 
gruence, also proved by Kummer: 


(3A) Let n> 1 and k be integers such that 2k > n +1. If p is an odd prime 
and if a is an integer, p X a, then 


n 2k+j(p—1) _ In 
GEO BrP = gmodp), 3.8) 





j=0 J 2k + j(p — 1) 
In particular, 
` {n\ Boy; 
17.) eV = 0 (mod p”). 
PA (Ja ee ES 


Concerning the divisibility properties of the numerators of the Bernoulli 
numbers, one of the earliest results is attributed to Adams (1878). In fact, 
he only proved a special case of a theorem previously stated by Sylvester 
(1861). The first proof of Sylvester’s theorem is due to Glaisher (1900): 


(3B) If p 4 Dar, if 2k = p'r, with t > 1, and if p Yr, then p'| No,. 
Letting B,,/2k = N4,/D, with D5, > 0, gcd(N4,,D%,) = 1, it follows: 
OCH If p is a prime, then pl Da, if and only if p|D%,. 
In 1845, von Staudt determined some factors of the numerator N>,. Let 
2k kb with gcd(k,,k) = 1 (3.10) 
such that p|k, if and only if p|D,,. von Staudt proved: 
(3D) With above notation: k, | Nax. 


This theorem however does not help to determine whether a prime p 
divides N,, for 2 < 2k < p — 3, since it only pinpoints factors k, < k. 

The following irregularity criterion appears already in Vandiver’s paper 
of 1932, though it is not stated explicitly as a lemma or theorem. It was found 
again by Montgomery in 1965; see also Carlitz, 1954. 
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(3E) A prime p is irregular if and only if there exists an integer k such that 
p divides Ny,/k, (where k, was defined in (3.10)). 


PROOF. As an illustration, I give this proof. If p is irregular, then there exists 
k, 1 < k < (p — 3)/2 such that p|N2,. But pX kı, so p|(N24/ky). 

Conversely, since p| N>,, then p ¥ Dax s0 p — 1 ¥ 2k. Let 2k = 2h (mod p — 1) 
with 2 < 2h < p — 3. By Kummere congruence B»,/2k = B,,/2h (mod p). So 


Nax 





2h k Dan = kyN 2,D ax (mod p). 
1 
Since p ¥ Dax, p X k2; by hypothesis, p| N, and so by Kummer’s regularity 
criterion, p, is an irregular prime. o 


4. The Abundance of Irregular Primes 


In 1847 Kummer did not know whether every prime is regular. But in 1850 
and 1851, he discovered that 37, 59, 67 are the only irregular primes less than 
100. In 1874, he extended his computations up to 164. At that time and based 
on probabilistic arguments (which he himself regarded as doubtful), Kummer 
advanced the conjecture that asymptotically there should be as many regular 
as irregular primes. 

Today, with extensive tables, it has been observed that the ratio 


number of irregular primes less than N 
number of primes less than N 





is about 0.39 when N is large. I’ll return to this point at the final part of this 
lecture, in connection with a heuristic prediction of Siegel. 

Despite the observed plurality of regular primes, it has not yet been shown 
that there exist infinitely many regular primes. 

On the other hand, quite surprisingly, Jensen proved in 1915 that there 
are infinitely many irregular primes. I give below a proof which is due to 
Carlitz (1954): 


(4A) There exist infinitely many irregular primes. 


PROOF. Let p,,..., Pm be irregular primes. Due to the growth of |B>,| with k, 
there is an index k such that 2k is a multiple of (p, — 1): (Pm — 1) and 
|B.,| > 2k. Let |B2,|/2k = a/b with a> b> 1, gced(a,b) = 1. 

Let p be a prime dividing a. I show that p / D2,. Indeed, from 


[Nab = 2kD2,a and pla, pyb 


it follows that p| Nz% so PX Dox. 
Let 2k = k,k, as in (3.10). Then by (3D) 
N 
Nas b = k Daxa. 


1 


5. Computation of Irregular Primes 107 


It follows that p divides Nax/kı. By the Vandiver and Montgomery irregu- 
larity criterion, p is irregular. Since p — 1 ¥ 2k (by the theorem of von Staudt 
and Clausen) p is distinct from p4, .. , Pm- CT 


As a matter of fact, Jensen had proved more, namely: 


(4B) There exists an infinite number of irregular primes p such that p = 3 
(mod 4). 


His ingenious proof, written in Danish, was made a available in English 
by Vandiver in 1955. 

Jensen’s result was the object of several generalizations. In 1965, 
Montgomery proved that if m > 2, there exist infinitely many irregular 
primes p such that p 1 (mod m). 

In 1976, Metsänkylä published the following result, which had been 
obtained independently in 1975 by Yokoi, for m prime: 


(4C) Let m > 2, let G be the group of invertible residue classes modulo m. 
If H is any proper subgroup of G, there exist infinitely many irregular primes 
p such that p modulo m is not in H. 


Up to now it is not known whether, given m > 2, there exist infinitely 
many irregular primes p such that p = 1 (mod m). Montgomery conjectures 
that this is indeed true. But the only available result in this direction is due 
to Metsänkylä (1971): 


(4D) There exists an infinite number of irregular primes p such thatp=1 
(mod 3) or p = 1 (mod 4). 


5. Computation of Irregular Primes 


To determine whether a prime p is irregular requires the computation of the 
residues modulo p of the Bernoulli numbers B,, for 2 < 2k < p — 3. Due to 
the size of the numbers involved, it is essential to derive congruences which 
reduce the amount of calculation. 

A very general congruence for the sums S,(n) was discovered by Voronoi 
(1889). It is reproduced in the book of Uspensky and Heaslet (1939) and it 
was rediscovered by Griin in 1940. 


(SA) Let m>2,k>1,a> 1 and let gcd(a,m) = 1. Then 


ml ; 
(a** — 1)S,,(m — 1) = 2kma?*! 2 pe H (mod m?) (5.1) 
iz 


and 


m-i € 
(a?® — 1)N, = Daikaih) 2 pe | (mod m). (5.2) 


J 
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One of the various applications of Voronoi’s congruences is to derive a 


simple proof of the following congruence of Vandiver (1917, and an easier 
proof in 1937): 


(5B) Let p be an odd prime, k > 1, anda> 1 hth = 
not divide 2k. Then = I, such that a < p and p — 1 does 


a-1 ; 
DEN Sapi (2) (mod p). (5.3) 
j= a 
I will now show how Stafford and Vandiver derived in 1930 some in- 


teresting congruences for B,,. They are very useful in determining whether 
a given prime is regular. 


(SC) If p is a prime, p>7,k > 1, and p — ¥ 2k, then 
A5 2k $ 2p 2k E 6P 2k al 
E Be A RT (modp). (5.4) 


pl6<j<p/4 





Note that the sum in the right-hand side is onl j 

Not y for j between p/6 
which is a relatively smali range. í a 

In 1954, Lehmer, Lehmer, and Vandiver made use of (5.4) to determine 
whether a given prime p is regular. By Kummer’s regularity criterion, this is 
equivalent to p ¥ By, for 2 < 2k < p — 3. If p does not divide 


So, = 3 ar 


PI6<j<p/4 
then also p 4 B,,. Again, if p|S2,, but p does not divide 
6?” 2k _ qp-2k _ 3p-2k 1 1. then p| Boy. 


No conclusion may be drawn if p|S., and p|6?~?* — 4P-2k _ 32-2k 4 1, 
In this situation, the same method may be applied (up to now with success) 


to es the following congruences, which were also proved by Vandiver 
in : 


(SD) If p is a prime, p>5,k> 1, and p — 12k, then 
gp 2k + 3PT2k Py 4p-2k des 1 
rf Bu= YP (modp. (55) 


pl4<j<p/3 





(SE) If p is a prime, p > 11,k> 1, and p — 1} 2k, then 


4P=2k 4 5p-2k_ gp-2k__ 4 ;2k-1 
S Ba= E PAIR E rt modp) 


pl8<j<p/5 3p/B<j<2p/5 





(5.6) 


l The evaluation of the residues modulo p of the sums in the right-hand 
side is laborious, but may still be carried out by computer for quite large 
values of p. 
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None of the above tests is theoretically assured to work. 
E. Lehmer proved in 1938 the following congruence: 
O p72 
(p — 2j)" = 2-1 pB, (mod p°) (5.7) 
j=1 
which is valid when 2k # 2 (mod p — 1). 

Carrying out the computation of the sum modulo p°, it it is congruent to 0, 
then p| Ba, and conversely. The major inconvenience with the above sum is 
that j runs from 1 to (p — 1)/2. 

After the original computations of Kummer, the irregular primes up to 
617 were determined with desk calculators by Stafford and Vandiver (1930) 
and Vandiver (1931, 1937). 

With the advent of electronic computers this program was extended by 
Lehmer, Lehmer, and Vandiver (in 1954) up to 4001, and then by various 
authors (Selfridge, Nicol, Pollack, Kobelev, and Johnson) up to 30000. 
Finally in 1976 with the IBM 360/65 and 370, Wagstaff determined all the 
irregular primes less than 125000. 

In the recent computations, especially those of Johnson and Wagstaff, 
much more numerical data has been accumulated, which I will now explain. 

If p is a prime, if 2 < 2k < p — 3 the couple (p,2k) is called an irregular 
couple if p| Boy. 

The index of irregularity of p, denoted ii(p), is the number of irregular 
couples (p,2k), where 2 < 2k < p — 3. Thus p is regular when ii(p) = 0. 

For every N > 1 and r > 1, let 2,(N) = # {p < Nlii(p) = r}. Let also 


n'(N) = #{p < N|pis irregular} 


and 
n(N) = #{p prime, p < N}. 


I summarize the results obtained by Wagstaff, which he kindly com- 


municated to me: 
For N = 125000: 





r{N) = 11734 
r(N)= 4605 
m,(N) = 3559 
nm (N)= 875 
m3(N) = 153 
nN) = 16 
m5(N) = 2 
n,(N) = 0 forr> 6. 
Moreover 
"UNI 
SON) = 0.39248. 


All the irregular couples (for p < N) have been completely determined. 
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Two pairs of “successive” irregular couples were found: (491, 336), (491, 338) 
and (587, 90), (587, 92). 

Ifh > 1, no successive irregular couples (p,2k), (p, 2k + 2),...,(p,2k + 2h) 
were ever found. 

Such occurrences of successive irregular couples turn out to be important 
in the light of other theorems which I will explain in a later lecture. 

No prime p was found such that p? divides some B;.. 

Among the open questions, I single out: Given s > 1, do there exist 
infinitely many primes p such that ii(p) = s? Similarly, if s > 2 do there exist 
infinitely many primes p such that ii(p) > s? This is only known to be true 
for s = 1 (Jensen’s theorem). 

According to a heuristic argument of Siegel (1964) 

m BE 
N>% m(N) Je 


This is very close to the observed distribution of irregular primes. His argu- 
ment may be extended and yields: 


— lim m(N) 1 
va SIN)  2"rla/e 

This also agrees very closely with the observed values. 
In a letter to Serre, Shanks wrote (3 December, 1975): 





P 








(for r > 0). 


r 


... one does have 
Is A P 
k=0 
as one should. But this implies, and I do not know whether anyone has noted 
it earlier, that 
1 OO 
=) kP,. (5.8) 
2 Zo 
So, among the z(N) — 1 odd primes <N one should expect about 
3(n(N) — 1) 


pairs of irregular primes and the Bernoulli numbers they divide [ = irregular 
couples]. Here is recent data of Wagstaff for N = 10°, x(N) — 1 = 9591 


k primes of index k 


0 5802 0 x 5802 
1 2928 1 x 2928 
2 728 2x 728 
3 123 3x 123 
4 8 4x 8 
5 2 5x 2 

9591 4795 


4(9591) = 4795.5. Mama Mia! 
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Actually with N = 125000, the coincidence is not so accurate: 


11733 
2(n(N) — 1) =.= = 5866.5 
while 1 x 3559 + 2 x 875+3x153+4x 16 + 5 x 2 = 5842. 
Yet it would be worth while to prove (5.8) since “it would imply that 
F.L.T. is true for at least one-half of the primes.” 
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LECTURE VII 


Kummer Exits 


After having established Fermat’s theorem for regular prime exponents, 
Kummer continued his work, considering the first case for arbitrary prime 
exponents. He was able to derive congruences, involving Bernoulli numbers, 
which must be satisfied by any would-be solution. From these congruences, 
he derived specific divisibility properties about Bernoulli numbers. 

The results of Kummer were expanded and given another form by 
Mirimanoff, and more recently, extended considerably by Krasner. I will 
explain these more recent results in my next lecture. 

As in his preceding work, Kummer began by breaking new ground in the 
theory of cyclotomic fields. This is what I will consider first. 


1. The Periods of the Cyclotomic Equation 


I will use the following notations: 
p = odd prime 
€ = primitive pth root of 1. 
D(X) = XP} + XP? +--+ X + 1 cyclotomic polynomial 
A=1-¢ 
K = Q(9 = cyclotomic field, of degree p - 1. 
A = Z[¢] = ring of cyclotomic integers 
g = primitive root modulo p 
G = Gal(K |Q) = Galois group of the cyclotomic extension 
oi — [? generator of G 


Gauss introduced the periods of the cyclotomic equation. If f, r are 
integers, p — 1 = fr, the r periods of f terms (relative to the primitive root 
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g modulo p) are the cyclotomic integers ge, 41, ...,4,—, defined by 
noe Ota car ++. + 


= W + cre + air? +e gr 
me ° ° (1.1) 


2r-1 


e DT E SEE TE 


Note that 
r-1 
GÉIE) (1.2) 
j=0 


For convenience, if j = jo (modr), 0 < jọ <r — 1, I also define 4; = fj 
The periods are conjugate to each other: 


o(n;) = Her (1.3) 


In particular, o(n)=n; (for i= 0,1,... x — 1). So the fields no), 
Oil... Q(y,- 3) are conjugate over Q. But K|Q is a cyclic extension, so 
Ay) = OD, 1). 

This field will be denoted by KI It is the subfield left invariant by the 
subgroup of G generated by o”, and [K:K'] = f, [K :Q] =r. 

Let A’ be the ring of integers of K'. 

In his papers of 1846, 1847, and 1857, Kummer proved the following facts: 


(1A) {nos .. A1) is an integral basis of A’; in particular A’ = 
Z[ no -< M-11- 


However, it is not true, in general, that Z[no] =: = Z[y,-1] = A’. 
Secondly, he showed also: 


(1B) A is a free A’-module with basis {1,6,... ,C4—"}. That is, every algebraic 
integer a e A is, in unique way, of the form 


f-1 . 

a= A ail, 
j=0 

where a; € A' (j = 0,1,...,f — 1). 
Let 
r—i 
F(X) = II X-n)eZz[X]. (1.4) 
i=0 


Kummer studied the congruence 
F(X) = 0 (mod q), (1.5) 


where q is any prime. 
I should say, at this time, that in his paper of 1846, Kummer made a 
mistake in his study; however, in 1857, he gave a correct proof the theorem 
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which follows (see comments by Weil, in volume I of Kummer’s Collected 
Works). In 1975, Maury presented a corrected proof of Kummer’s result in 
today’s language. 


(1C) Let q be any prime, let fr = p — 1. If Q is any prime ideal of A dividing 
Aq, then for every period n, (0 < k <r — 1) there exists a unique integer u,, 
0 < ug < q — 1, such that Ny = u, (mod Q). In particular 

r-i 


F(X) = L (X — u) (mod q). (1.6) 


1 wish to stress that the numbers uo, u,,..., 4, 1 need not be distinct, for 
example, they are certainly not when q < r. 

If q is a prime different from p, then it is unramified in K, that is, Aq is the 
product of distinct prime ideals: 


Aq = Get "Ou (1.7) 
Let fr = p — 1, let yo, 41,---, Ny- be the r periods of f terms and K’ = 
On) = = Q(y,- 1), A = Zn. Nr- 1]. Then A'q is also the product 
of r distinct prime ideals in A’, each having degree 1: 
Aq = OO Oa 
q = 2004 Q,-1 (1.8) 


A'lQ = Fa (i= 0,1,... xr — 1). 
The numbering is such that 
AQ; =Q; (fori=O,1,...,7— 1). 


2. The Jacobi Cyclotomic Function 


Another tool in Kummer’s research was the generalization by Jacobi of 
Gauss’s and Lagrange’s resolvents. These were quite essential in studying 
the question of solution by radicals. 

Besides the notation already introduced, J shall fix the following: 

Let q be an odd prime, q # p, and 


h = primitive root modulo q, 
p = primitive qth root of 1, 
0 = primitive (q — 1)th root of 1. 


If t is not a multiple of q, there exists a unique integer s, 0 < s <q — 2, 
such that t = h (mod q). s is called the index of t, relative to h, and denoted 
s=ind,(t), or simply, s=ind(r). Thus ind(1) = 0, ind(—1) = (q — 1)/2. 
Clearly, rer (mod q), then ind(t) = ind(#’). Also 


ind(tt’) = ind(t) + ind(t)(mod q — 1)when q Fr, 
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The Lagrange resolvents are sums of the form: 
es? u i a1 ie 
H Go = X Griet. (2.1) 
i=0 t=1 

Jacobi considered in 1837 (in a paper reprinted in 1846) the analogous 


sums when q = 1 (mod p). Let q = pk + 1, so k is an even integer. If m, n are 
integers, p Yn, q X m, the Jacobi resolvents are 


g-i . q-2 S 
E E er X ER indn() gmt se X Ce? , (2.2) 
t=1 u=0 


Clearly <¢",p"> belongs to the ring of integers of Q(¢,p). 
The Jacobi cyclotomic functions are defined as follows: If I, n are integers, 
pn let 


q-2 


Ye) = 5 grinda) + 1)ind,(t+ D (2.3) 
t=1 


Then y,(¢") is an integer of Q({). 

Note that @(¢,p) is a Galois extension of Q(¢). Let ebe the automorphism 
of Q(¢,p) such that F(t) = C9, F(p) = p. 

DU now list various properties of these sums and of the Jacobi cyclotomic 
functions. 


(2A) If p¥ n, qm, then 


a. Kr, = Cent "ail, 
b. <¢,p>? and o<C,p>/<C,p>? belong to OC) 
c. Goin oz q. In particular <¢,p> # 0. 
d If lis any integer such that p l+ 1, then 
_ <6 ><C,p> 
MANY COPE gy. 


e. (Cp)? = aW (EWE) "aal 
f. If1=1,2,...,p— 2, then WOYE) = a. 


The proof of these properties is of course somewhat laborious, but involves 
no essential difficulties. Most useful is the expression of (£,p)? given in (e). 

The main result of Kummer concerns the decomposition into prime ideals 
of the ideal of A = Z[{] generated by <¢,p>’. A preliminary result is the 
following: 


(2B) If qg = kp + 1 and if Q = Aq + A(h* — 0), then Q is a prime ideal with 
norm q and Aq = | [Pze o'(Q). 


Now let z = (p — 1)/2 and for every integer k let g, be the unique integer 
such that 1 < g, < p — 1 and g* = g, (mod p). If k < 0 this is to be understood 
as gg "= 1 (mod p). 
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In 1857, Kummer proved: 


QC) If 1<d<p-—2, let 


I, = {il0< i< p — 2,9,-i + Gn-itinayia) > P} (2.4) 
Then: j 
Awdo) = Il >O) (2.5) 
and iela 
p72 
ACép»? = I [oQ]. (2.6) 


This was the main basic result of Kummer concerning the Jacobi re- 
solvents. It was generalized by Stickelberger in 1890, who allowed any 
natural number m > 3 in place of p, with q prime, q = 1 (mod m). 

As an immediate corollary: 


(2D) If Q, is a prime ideal of degree 1 of K = Q(Q), then 
p-2 
H Leto]: 
i=0 
is a principal ideal. 


The virtue of the above result is to produce a principal ideal. 


3. On the Generation of the Class Group of 
the Cyclotomic Field 


In 1847, Kummer proved a theorem about the generation of the class group 
of the cyclotomic field. Later, with analytic methods, Dirichlet strengthened 
Kummer’s result in a very substantial way, by proving the density theorem. 
From this result, he was able to prove the celebrated theorem on primes in 
arithmetic progressions. 

Kummer’s results were: 


(3A) The group of ideal classes of the cyclotomic field is generated by the 
classes of prime ideals of degree 1. 


As I said above, much more is true, but I'll comment later about it. 

With this result and all those indicated before, Kummer proved the 
following result, which enabled him to produce principal ideals from any 
given ideal J: 


(3B) If í < d < p — 2, for every ideal J of A, the product Tier. oJ) is a 
principal ideal. 
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To convey an idea how the preceding results unite to establish this one, 
I will briefly sketch the proof. 


PROOF. By (3A), J is in the same ideal class as a product of prime ideals Q; 
of degree 1: 


Let N(Q,) = q,, so q; = 1 (mod p) because the inertial degree of Q; is the 
order of q; modulo p. Let q; = k;p + 1 and let Q¥ = Aq, + A(hj — 0), where 
h; is a primitive root modulo q;. By (2B) 

p-2 


Aq; = TT o(Q#) 


i=0 


so Q; is conjugate to Qt, say Q* = o"(Q,), where 0 < j* < p— 2. 
By (2.5) 


II stop = o"( TI oto) 
iela iela 
is a principal ideal of A. Hence 
I au = A(T] sel D (Tot) 
iela iela J=1 \iela 
is also a principal ideal of A. oO 
The strengthening of (3A) done by Dirichlet asserts that in every ideal 
class there is, not only one prime ideal of degree 1, but in fact, infinitely many 


such. Even more, these ideals have a positive density. But Pll not elaborate 
on this point. 


4. Kummer’s Congruences 


Having proved Fermat’s theorem for regular exponents, Kummer next 
considered arbitrary exponents. Initially, he dealt only with the first case. 

A preliminary result in the line of Kummer’s research was obtained by 
Cauchy in 1847. Let p be an odd prime. 


(4A) If p does not divide the sum: 


SE — Den 
JS: Jawa arte (27) 3 


then the first case of Fermat’s theorem holds for the exponent p. 
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As Genocchi pointed out in 1852 (and again in 1866), the above sum is 
related to a Bernoulli number: 


—2B,_, = JE: d (mod p) (4.1) 





and Cauchy’s result may be rephrased: 


(4B) If the first case of Fermat’s theorem fails for the exponent p then p 
divides H. — 


Kummer extended this criterion, as a consequence of his congruences. 
This is what I shall now explain. 
Assume that x, y, z are relatively prime integers such that p Y z and 


xP + yP + 2P = OU 42 
Then H Si 


(Az)? = A(x? + y’) = T A(x + Cy), 
k=0 


As I have indicated in my fifth lecture, prior to proving Kummer’s main 
theorem, there exist pairwise relatively prime ideals L, J of A such that 


oa (43) 
A(x + Cy) = 
Then 
A(x + Cy) =(o*(J))? fork =0,1,...,p — 2. (4.4) 


Taking norms in the extension Q(£)|Q, |x + y|?~' = P, where N(L) = |. 
Then x + y has to be the pth power of an integer, x + ve t?, and L = At. 
By (3B), for each d, 1 < d < p — 2, the ideal Į ];.,, o'(J) is principal, say 


iela 


= [| Ax +y) 


iela 


Hence 


so there is a unit e of A such that 
eah = [] (x + Cy). (4.5) 
iela 
Let I; = {il0 < i < p — 2,i¢ la}. It is easy to see that i’ € I, if and only if 
i’ =i +n (mod p), where ie I, and z = (p — 1)/2. Hence taking complex- 


conjugates ; 
wag = [| (x + Sy). (4.6) 
ielg 


Multiplying (4.5) and (4.6): 


-Y= Wie + Cy) = [I xe + iy) TI te + Chy) = lato)”. 


iela iela 


Hence & is the pth power of a unit o gë = œ”. 
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Since e = (5, where 6 is a real unit, € = (~"5 so œ” = ô? and therefore 
ô = (w"d*)?, where sp + 2r = 1. 
In conclusion, l 
[] ( + oy) = "aP (4.7) 
iela 
with a = œ ao € A. 
Since «€ A there exists a polynomial F(X) e Z[X], of degree at most 
p — 2, such that « = F(¢). Hence the polynomial 
II (& + X"y) - run 


iela 


vanishes at £. So it is a multiple of the cyclotomic polynomial ®,(X) and I 
may write 
[] (x + X%y) = X"(F(X))? + AXEL (4.8) 


iela 


where M(X)e Z[X]. 
Look at the corresponding polynomial functions of the positive real 
variable t. Then 
[I (x + ty) = FOY + POMO. 
iela 
Since t > 0, it may be written as t = e”, where v an arbitrary real number. 
Hence 
[I (x + ey) = Fle) + ®,(e’)M(e’) 


iela 


Je lte) 
= e”(F(e’))?| 1 + <. 49 
e™(F(e’)) | + Fe) (4.9) 
Note that x, y and the coefficients of M(X), F(X) may well be negative 
integers. So, taking the logarithms of both sides of (4.9) it is necessary to 
consider the complex logarithmic function; this leads to an equality up to 
some multiple of 2zi, which will be irrelevant, after taking derivatives: 
A log(x + ey) = mv + plog Fle’) 
iela 
Ae Wie? 


| + 2kni (wherekeZ). (4.10) 


+ EI + 


The following notation will be used. If G(v) is a differentiable function of 
v, let D"G(v) be the value of d"G(v)/dv” at v = 0. 

One might wonder, at this point, how Kummer could untangle himself 
and still reach some worthwhile conclusion from (4.10). But, with his mastery, 
nothing should astonish us anymore. What he succeeded in proving (1857) is: 


(4C) Assume that p is an odd prime, x, y, z are relatively prime integers, p 
does not divide z and x? + y? + z? = 0. Then the following congruences are 





4. Kummer’s Congruences 123 


satisfied: 
[D?~**log(x + e’y)]B,, = 0 (mod p) 


for 2s =2,4,...,p—3. 
Similar congruences hold when x and y are interchanged. 


ProorF. Since the proof of this theorem is quite long, I have to restrict myself 
and only provide a sketch of its main points. 
(a) Ifn=2,...,p—2, then 


3 D"log(x + ey) = 0 (mod p). (4.11) 
iela 


For this purpose, one computes the value at v = 0 of the nth derivative 
of the right-hand side of (4.10). For example 


P 


D"(p log F(e’)) = Fay 


H(FÜ),F(),...,F®(1)), 
where H is a polynomial with coefficients in Z and p does not divide F(1), 
as may be seen without difficulty. 
For the last term one proceeds in a somewhat similar manner. 
(b) [D" log(x + e’y)](Sier, 9") = 0 (mod p). This follows at once from (a). 
Let S = Änt, 9 "H. In order to evaluate the residue of S mod p, various 
facts have to be proved. 
(c) 
1 1 whenie Ij, 
plant + Gr +indg(d)—i ~ Jn +indg(d+ D-i] = d when i¢ l. 


Indeed, the number in brackets is congruent to g’"{1+d-(d+1))= 
0 (mod p). From this, it is easy to establish the assertion. Hence 


p-24 
= -29i 
S= 3. plIn-: + In-itinasin — Ir-i+indgld+ vig” TT. 
i=0 


Since of = g (mod p), 


pS = S, + Sa — Sarı (mod p°), (4.12) 


where 
p-2 


S= H Ir it, (4.13) 


i= 


(d) 
pP 72 . 
S; = — jr(p~ 2s) > og. oi" Zelt (mod p°). 
i=0 


This is easy to prove. It follows from (4.12) that 


p-2 
pS =—[1 + dw 29 — (d + ll È eer (mod p?). (4.14) 
i=0 
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Writing g-; = k, the above congruence, becomes 
p-1 
pS = —[1 + dP? — (d+ pee H vc (4.15) 
k=1 
Let 1 + p(2s — 1) = 2m. Then, as I have indicated in Lecture VI, 


1 
kre) Bad 


=1 
1 msi, fam+l 2 2m+1 2m-1 
ii d 1 JOP t a jr 


2m +1 2m +1 
pl. (4.16) 
4 2m 


From the theorem of von Staudt and Clausen, the denominator of a 
Bernoulli number is square-free. Hence 


Saml P 7 1) = BomP (mod p°) 


p- 
k 





(4.17) 


and 
S= —[1 + d?” — (d + 1)?" ?*]B,,, (mod p). 


But 2m = 2s + (p — 1)(2s — 1). By Kummer’s congruence (explained in 
Lecture VI, Formula (3.7)) 


Bom = Bas (mod p). 
s 


(4.18) 





Hence 





B 
Bas _ Bas (mod p). 
2s 


5 = (4.19) 


Ba, = [1 + pQs — 1)] 
From (b), written for n = p — 2s (2s =2,4,...,p — 3) and (4.18), (4.19), 
it follows that 


[D?-*log(x + em + ar — (d + Dr] = 0 (modp). (4.20) 


(e) Conclusion. The polynomial 1 + XP”2° — (X + 1)?” 2° is not identically 
zero and has degree p — 2s — 1 < p — 2. So there exists d, 1 < d < p — 2, 
not satisfying 

14+ X77 - (X + 1)?" = 0(mod p). 


From (4.20), it follows that I may divide by 1 + dë 
this choice of d), hence: 


(d + 1)?~25 (for 


B 
[D?~**log(x + e’y)] ES = 0 (mod p). (4.21) 


From this, the congruences of Kummer follow immediately. oO 


If x? + y? + z? = 0 and p does not divide x, y, z, then the congruences of 
Kummer hold also for the pairs (x,z), (z,x), (y.Z), (z,y). 








D 


ee el E deg e ee Te emm ele Te DÉI 
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By computing the derivative explicitly, it is easily seen that 


EEN 
(x + yy" 
where R,(X,Y) is a homogeneous polynomial, with coefficients in Z, of 
total degree j, which is a multiple of Y. 

Moreover, it has the following properties. If j> 1, then R,(Y,X)= 
(—1)/R,(X,Y) and if j is odd, j > 1, then XY(X — Y) divides R,(X,Y). 

Writing Y = XT and P,(T) = R,{1,T) gives R(X,Y) = X/P,(T). 

With these notations, Kummer’s congruence may be easily rewritten as 
follows: 


(4.22) 





Di log(x + ey) = 


(4D) If x? + y? + 2? =0, pf xyz, for 2s = 2,4,..., p — 3, and 


then 
Pp-2s(t)B2s = 0 (mod p). 


This is a more useful form of Kummer’s criterion; instead of derivatives, 
it makes use of the values of certain polynomials. 

In order to apply this criterion, it is important to compute these poly- 
nomials. For example, for small indices: 


P(T)=T, 

P(T)=T, 

PT) = T(1 — T), 

P(T) = TU — AT + T’), 

P.(T) = T(1 — TJA - 10T + T?), 

P(T) = T(1 — 26T + 66T? — 26T? + Tä 


The first application by Kummer of his criterion was the extension of the 
results of Cauchy and Genocchi: 


(4E) If x? + y? + 2? = 0, with p X xyz, then p divides both B,- and B,-s. 


Proor. Assume that R3(a,b) = 0 (mod p) where a, b are distinct, and a, be 
{x,y,z}. Then P(t) = 0 (mod p), where t = a/b (note that p% xyz). Hence 
pla — b, that is, a = b (mod p). For the various choices of a, b, this yields 
x = y =z (mod p). Hence 


3x =x+yt+z=xP +y’ + 2? = 0(modp) 


and because p + 3, it follows that p|x, against the hypothesis. Thus, for some 
a, b necessarily P3(t) # 0 (mod p). So, taking 2s = p — 3, it follows from (4D) 
that B,_3 = 0 (mod p). 

The proof that B,_s = 0 (mod p) follows similar lines, and is just a little 
bit more involved. Oo 
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There have been various proofs of other criteria, analogous to those of 
Kummer’s. See, for example, Vandiver (1919). In 1922, Fueter was able 
to again prove Kummer’s congruences using only methods developed in 
Hilbert’s Zahlbericht. Let me add that this didn’t make matters any easier! 


5. Kummer’s Theorem for a Class of 
Irregular Primes 


In his memoir of 1857, Kummer proved a theorem which establishes the 
truth of Fermat’s last theorem for certain irregular exponents. To arrive at 
this theorem, Kummer investigated thoroughly the structure of the group 
of units of the cyclotomic field. He also used A-adic logarithms. However, 
while Kummer’s results were ultimately correct, there were several important 
gaps and mistakes in his proof. These were pointed out initially by Mertens 
in 1917. Vandiver clarified the doubtful points and provided rigorous proofs 
(1920, 1922, 1926, 1926). 
I keep the same notation as before. If « e A, then a= a) + a, +++- + 
de. SCH" with aq; € Z. 
Associated with q, let at) be the function of the real variable x, defined by 
a(x) = Ag + ayX + + ay kel 


Thus a(f)=a, al)=a+a,+'''+qa,-.. Since A=1-[L, «= 
a(1) (mod Az). Ifa ¢ AZ, then a(1) is not a multiple of p. 
Let v, be the A-adic valuation of K; it is the one associated to the prime 


ideal AA. Then 
a 
1 —-——j)>1 
d in) z 


Extending v, to the completion R of K (with respect to the topology 
defined by v,) Kummer considered the /-adic logarithmic function (1852): 


a u Ké (1771 a n 
(an) Ee Lë ES 


Eisenstein also had considered the å-adic logarithmic function in 1850. 





In 1851, Kummer also considered the function / (for s = 1,2,...,p — 1) 
defined as follows. If t is a real variable, oe A\AA, then: 
e t 
me = [eee] Së 
dr t=0 


I®(x) is a p-integral rational number. 


The generalized concept of index was introduced by Kummer in 1852. 
Let q be a prime, q # p, let Q be a prime ideal of A dividing Aq, with norm 
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N(Q) = al. Let « e A\Q. Then it is clear that x17! = 1 (mod Q). Moreover, 
there exists a unique integer c, 0 < c < p — 1, such that 


aa "e = Ce (mod Q). (5.3) 


c is called the index of x and denoted indg(«) = c. 
The pth residue power character modulo Q is defined (for a e A\Q) by: 


‘at = Cado, (5.4) 


Gr = ail DIP (mod Q). 


Among the properties, I note: 


1. Ifa, ße A\Q and « = ß (mod Q), then {«/Q} = {8/0}. 
2. Ifa, ße A\Q, then {%B/Q} = {a/Q}{B/Q}. 

Moreover {«/Q} = 1 if and only if there exists ße A\Q such that «= 
B? (mod Q). 

In 1852 Kummer generalized the Jacobian cyclotomic functions. First, he 
noted that if £ is a given primitive pth root of 1, there exists p, a primitive 
root modulo Q (that is, the residue class of o modulo Q is a generator of the 
multiplicative group of the residue class field A/Q) such that 


{= pe wP (mod Q). 


Let q be an odd prime, q # p. For every k = 1, 2,... 
integer j, let 


,p — 1 and every 
Te kl ++ (5.5) 
(ri — [ind g(t + )-G+ DA S 
WA" È ¢ 
t#(qf-1)/2 
Ifq = 1 (mod p), that is, if f = 1, these functions are essentially the Jacobi 


cyclotomic functions, as defined in this lecture. 
Mitchell extended this definition in 1916 to also include the case where 


. CH the properties of the Kummer cyclotomic functions, I mention: 
GIN) (5.6) 
If, j #0, — 1 (mod p), then 
GIG EE (5.7) 
If f is even, then 
vd = q”. (5.8) 


If f is odd, then 


YE) = £Q”o(Q)™ --- ai" pc), (5.9) 
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where rf = p — 1, c is the automorphism of K such that o(f) = C9, g a primi- 
tive root modulo p, 


m, = Bus + Sitp-1)-i+ indgi = Sa p-1)~itindg(j+ 1) 


1 
Sk = 5% + rt tigen) 








and 
gx = 9'(modp), 1t<g,<p-1. 
Let 
1- g = =g 
EE 
1-¢€ 1-¢ 


be one of the Kummer circular units (= Kreiseinheiten). Let 


Vai) 
6,= M a") (5.10) 


i=0 


for s = 1,2,...,(p - 3)/2. 
In 1852, Kummer computed the index of 0, in terms of the cyclotomic 
functions: 


(5A) If s =1,2,...,(p — 3)/2 and if 1 <j < p -— 1, then 


eil 1 
x 
2 1+ jP 47 — (f+ 1)? 





indg(0,) = Ir" ig (C) (mod p). 


Also, if f does not divide p — 2s, then indọ(0,) = 0 (mod p). 

Using this arsenal, Kummer studied Fermat’s equation under certain 
working hypotheses. 

Assume: 


I*. The first factor h* of the class number of K is divisible by p but not by p°. 


The following result was assumed tacitly by Kummer; its first complete 
proof is due to Vandiver (1922): 


(5B) If Hypothesis (I*) is satisfied, then: 


I’. There exists a unique index 2s, 2 < 2s < p — 3, such that p|B ,; more- 
over p° X B,,. 


Next, Kummer assumed: 


II. There exists an ideal J of A such that the unit 6, [corresponding to the 
index s of (I’)] is not congruent modulo J to the pth power of any element 
of A. 
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He proved: 


(50) Letq be an odd prime, q # p, let Q be a prime ideal dividing Aq, and 
assume that OR = Aa (his the class number of K). Under Hypothesis (I’) and (II): 


(- Ir g?s Se 1)Bz2ps-p+ 1 
2h 





indọ(0,) = x Ir" 25(q) (mod p), 


where s is defined in Hypothesis (1'). 


If J = ine ı Q; (where each Q; is a prime ideal not dividing Ap), let ind,(«) = 


AN, indg,(a). 
With this definition: 


(5D) If OD) and (II) are satisfied and if J is any ideal whose ideal class has 
order (in the ideal class group) not a multiple of p, then ind,(0,) = 0 (mod p). 


(5E) If (V) and (II) are satisfied and if J is any ideal such that J? = Aa, 
a principal ideal, then J itself is a principal ideal if and only if 
Ip" Zeite) = 0 (mod p), 
[where s is defined in (UL 


From all these results, Kummer concluded: 


(SF) If (U) and (II) are satisfied, then p does not divide the second factor h* 
of the class number. 


(5G) If (I') is satisfied and if p divides h* , then the unit 0, [ for the index s 
defined in (1’)| is the pth power of a unit of K. 


Kummere proof of the next theorem was incorrect. But, thanks to 
Vandiver’s work, (1926), it is now rigorously established: 


(5H) If OU) is satisfied, if « e A\AA and a is real, and if (a) = 0 (mod p), 
then there exists a unit e such that ca = m (mod p), where me Z. 
Let 


DL Ifs is given by (UL then B,,, is not a multiple of p°. 


The following result was again put on firm ground by Vandiver: 


(SD) If OU) and (III) are satisfied, if p does not divide h*, and if e is any unit 
of K congruent modulo Ap? to an integer me Z, then there exists a unit Eo 
such that ¢ = eh. 
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With these preliminary results, Kummer finally proved: 


(5J) If the conditions (I*), (IT), and (III) are satisfied, then Fermat’s theorem 
holds for the exponent p. 


This theorem applied to the irregular prime exponents p = 37, 59 and 67. 
He verified by actual computation that the conditions (1*), (ID) and (IH) were 
satisfied. 

I should add that in 1893, Mirimanoff gave an independent proof of 
Fermat’s theorem for p = 37. So, even discounting the inaccuracies of 
Kummer, Fermat’s theorem was established without any doubt for the 
smallest irregular prime. 


6. Computations of the Class Number 


Kummer had made, all through his life, extensive computations of the class 
number of cyclotomic fields. In my sixth lecture, I have already briefly 
mentioned his findings about the irregular primes less than 164. As a matter 
of fact, not only did he determine whether a given prime p would or would 
not divide some Bernoulli number B,, (2 < 2s < p — 3), but he actually 
determined the first factor of the class number, its factorization into primes 
(for all p < 164). All this was done without the help of any machine. And 
amazingly, a recent check by Newmann (who extended Kummer’s table) 
in 1970, uncovered only three mistakes: p = 103, 139, 163. 

I think that it is instructive to reproduce Kummer’s table (as amended by 
Newman), to give a better feel of the growth of the first factor. 


Prime p h*(p) into prime factors 





3 1 

5 1 

7 1 

11 1 

13 1 

17 1 

19 1 
23 3 
29 2x2x2 
31 3x3 
37 37 
41 11 x 11 
43 211 
47 5 x 139 
53 4889 
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Prime p h*(p) into prime factors 





59 3 x 59 x 233 

61 41 x 1861 

67 67 x 12739 

71 7 x 7 x 79241 

73 89 x 134353 

79 5 x 53 x 377911 

83 3 x 279405653 

89 113 x 118401449 

97 577 x 3457 x 206209 

101 5x5x5x5x5x 101 x 601 x 18701 
103 5 x 103 x 1021 x 17247691 

107 3 x 743 x 9859 x 2886593 

109 17 x 1009 x 9431866153 

113 2x2x2x 17 x 11853470598257 

127 5 x 13 x 43 x 547 x 883 x 3079 x 626599 
131 3x3x3x5x5x 53x 131 x 1301 x 4673706701 
137 17 x 17 x 47737 x 46890540621121 


139 3 x 3 x 47 x 47x 277 x 277 x 967 x 1188961909 
149 3 x 3 x 149 x 512966338320040805461 
151 7x 11x 11x 281 x 25951 x 1207501 x 312885301 


157 5 x 13 x 13 x 157 x 157 x 1093 x 1873 x 418861 x 3148601 
163 2x 2x 181 x 23167 x 365473 x 441845817162679 


Other recent tables of h*(p) and their factors were compiled by Schnuttka 
von Rechtenstamm (up to 257) and by Lehmer and Masley (up to 521). 
I mentioned in Lecture VI the formula for the first factor of the class 


number, namely 


1 eae PT 6.1 
h*(p) = Qpe 27 ott) Gh CH. (6.1) 


where 7 is a primitive (p — 1)th root of 1, G(X) = = ê g;X’, gisa EE 
root modulo p, 1 < g; < p — 1, and g; = g’ (mod p). Kummer has use k is 
formula for his computations. However, for larger values of p, it is rather 
inefficient. Lehmer and Masley based the computations on the evaluation 
of the Maillet determinant. 

If p is an odd prime, and r is not a multiple of p, let R(r) denote the least 
positive residue of r modulo p. For every integer r, 1 < r < p — 1, let die 
the unique integer such that 1 < r < p—1andrr=1 (mod p). The Maille 
matrix for p is 

Mp = Urs... p- 1912 (6.2) 


Its determinant will be denoted by D,. 
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The following formula was discovered by Weil, who did not publish it. 
Carlitz and Olson discovered it independently (1955): 


1 
h*(p) = dean DI (6.3) 


With this formula, h*(p) was evaluated by Masley, in the range indicated. 
Concerning the growth of h*(p), Kummer stated a very interesting con- 
jecture which is as yet unproved: 
(p+ 3)/4 


H 
h*(p) ~ y(p) = P= 32 (7-1/2 (6.4) 





(asymptotically as p tends to infinity). This conjecture has stimulated a 
number of deep recent investigations. I would like to mention some of the 
outstanding results obtained thus far, in this vein. 

In 1951, Ankeny and Chowla proved 





h* 
log Wé 
tee (6.5) 
p>% logp 


or equivalently 
log h*(p) = Ap + 3)log p — $plog 2x + o(log p). 
In 1964, Siegel proved a weaker result: log h*(p) = logy(p) + o(p log log p). 
It follows that 
log h*(p) ~ i log p. 


Computations by Pajunen in 1976 showed that for 5 < p < 641: 
2 h* 3 
<P 3 


A ei 3 
At the present, the best result is due to Montgomery and Masley (1976): 
{f p > 200, then 


Ga P? pP 254 < h*(p) < (2n)"P/2pir+ 31/4 (6.6) 





Despite the growth of h*(p), it is nevertheless bounded by some quite nice 
functions. In Lepistö (1974) and Metsänkylä (1974) it is proved that 


p (p-1)/4 
h*(p) < 2p (2) (6.7) 


In 1961, Carlitz gave the following other upper bound (which is ultimately 
weaker): 





D 7 ei when p = 1 (mod 4), 


h*(p) < (6.8) 


DG E when p = 3 (mod 4), p > 3. 
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Ankeny and Chowla also proved in 1951 that there exists pọ such that 
h*(p) is strictly increasing with p, for p > po. It has been conjectured by 
Lepistö that pọ = 19. The latest computations of Lehmer and Masley con- 
firm this conjecture up to 521. 

A fact that was long suspected and has now been established, is the 
following theorem, obtained independently by Uchida and Montgomery in 
1971. 


If the class number h(p) of Q(¢,) is 1, then p < 19. 


Even better, if h*(p) = 1, then h*(p) = 1, so that h*(p) = 1 already implies 
p<19. 

I should say also that in his thesis (1972), Masley determined all the 
cyclotomic fields Q(¢,,) (where m #2 (mod 4), without loss of generality) 
having class number 1 (see Masley and Montgomery, 1976). There are 29 
distinct such fields, namely, for m = 3, 4, 5, 7, 8, 9, 11, 12, 13, 15, 16, 17, 19, 
20, 21, 24, 25, 27, 28, 32, 33, 35, 36, 40, 44, 45, 48, 60, 84. 

The little that is known about the first factor of the class number of 
cyclotomic fields seems immense when compared to what has been estab- 
lished about the second factor. 

It is again Kummer who proved, as I have stated in my sixth lecture: 
An odd prime p divides h(p) = h*(p)h*(p) if and only if p divides h*(p). In 
other words, if p|h*(p), then p|h*(p). 

As for the parity of the class-number factors, Kummer proved in 1870 
that if h*(p) is odd, then so is h*(p). 

In his computations up to 163, only h*(29), h*(113), h*(163) are even. 
However, he proved that h*(29) = 1, h*(113) is also odd, while h*(163) is 
even. He also established that h*(257) is divisible by 3, and that h* (937) is 
even. 

A method to produce cyclotomic fields with second factor greater than 1 
was invented- by Ankeny, Chowla, and Hasse in 1965. It was based on a 
lemma by Davenport and uses class field theory. This is the lemma: 


If l, m are positive integers, if m is not a square, and if the equation 
X?—(? +1)¥? = +m 


has solution in integers, then m > 21. 


Next, it is shown that the class number H(p) of the quadratic field Q(,/p), 
where p = (2qn)? + 1, q a prime, n > 1, is greater than 1. 

A comparison by means of class field theory of the field Q(,/p) and the 
real cyclotomic field Q(¢, + £, +) yields the following result: 

If q is a prime, n > 1, and if p = (2qn)? + 1 is also a prime, then h*(p) > 2. 
In particular, for p = 257, 401, 577, 1297, 1601, 2917, 3137, 4357, 7057, 8101, 
h*(p) > 2. 
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A variant of the above result was presented by S.-D. Lang in 1977. He 
proved that if q is a prime, n > 1, and if p = [(2n + 1)q]? + 4 is also a prime, 
then h*(p) > 2. This gives h*(229) > 2 and many more examples. 

Does this method provide infinitely many examples? This question is at 
least of the level of difficulty of the problem of Sophie Germain on the 
problem of twin primes. 
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LECTURE VII 


After Kummer, a New Light 


TU report on the work of Mirimanoff, inspired by the last of Kummer’s 
papers. With his great ability, he refined Kummer’s treatment for the first 
case and obtained new congruences. On the other hand, due to the difficulty 
in achieving these comparatively meager improvements, it was obvious that 
no further progress would be forthcoming along these lines. 

Then in 1909 came Wieferich. He discovered a criterion for the first case 
of an entirely different nature. 

His first proof was an enigma. Few people were able to understand how 
Wieferich succeeded, like a magician, in unravelling from very complicated 
formulas, so simple and beautiful a criterion as: 


If the first case fails for the exponent p, then p must satisfy the congruence 
2?-1 = 1 (mod p°). 


Note that this condition is the first, in the history of Fermat’s problem, 
which does not involve the would-be solutions x, y, z of X? HP + Z° = 0. 
Just the exponent, and as I'll show, it is a very stringent condition indeed. 

With every breakthrough, come the followers. In this case a distinguished 
group including Mirimanoff, Frobenius, Vandiver, Pollaczek, and 
Morishima. 


1. The Congruences of Mirimanoff 


In 1905, Mirimanoff transformed Kummer’s congruences. I recall that 
Kummer showed: 


If x, y, z are relatively prime integers, not multiples of p, such that 
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xP + yP + 2? = Q, then 
[D?~ *5log(x + e’y)]B,, = 0 (mod p) (1.1) 
for 2s = 2,4,..., p — 3. Similar congruences hold by permuting x, y, z. 


Here D"f(v) denotes the nth derivative of f(v) computed at v = 0. 
As I pointed out, 


Riy) 
(x+y 





Di log(x + ey) = 


where R,(X,Y) is a homogeneous polynomial with coefficients in Z and total 
degree j. 

I recall the properties of these polynomials. Putting Y = XT, then 
RiX,Y) = XPT), where P,(T) has coefficients in Z and degree j — 1. For 
j>1 and odd, T(1 — T) divides PT), so P(T)= T(1 - T)LT), and 
L,T) has degree j — 3. 

The congruences of Kummer become 


P,- as(t)B2, =0 (mod p), (1.2) 
where 2s = 2, 4,..., p — 3 and 


Since x? + y? +z? = 0, x + y + z = 0 (mod p). 
If t = x/y, then the elements of G are congruent modulo p, to those of the 


set 
1 1 t 1 
= —,;, =, — t =Z Vene 1 ~ . 1.3 


In some special cases, H degenerates: Ift = 1, or —2,or — 4 (mod p), then 
H = {1,—2,—4}. If £? +t+1=0 (modp), then p=1 (mod6) and H has 
only 2 distinct elements, In all other cases, H has 6 distinct elements. 

I note also that none of the elements in G may be congruent to 0 or to — 1 
modulo p. 

Mirimanoff’s first result extended an earlier criterion of Kummer: 


(1A) If x? + y? + z? = 0 and py xyz, then B,-, and B,-s are multiples 
of p. 


SKETCH OF THE Proor. Since [D7 log(x + e’y)]B,-, = 0 (mod p) (and sim- 
ilarly, for any permutation of {x,y,z}), it suffices to show that D’ log(x + ey) 
(or that any of the derivatives obtained by such a permutation) is not con- 
gruent to 0 modulo p. 

Assume that this is false, so P-(t) = 0 (mod p) for every t e G. As I men- 
tioned in my last lecture, P(T) = T(1 — T)L+(T), where L-(T) has co- 
efficients in Z and degree 4. 
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If the elements of G are pairwise incongruent, they cannot all be roots of 
the congruence P,(T) = 0 (mod p) (note that the elements of G are not 
congruent to 0 (mod p)). 

If the elements of G are congruent to those of {1,—2,—4}, then L,(— 2) = 0 
(mod p). 

But 

LAT) = 1 — 56T + 246T? — 56T? + T*. 


Hence p divides 7 x 223, so p = 223 (since 7 is excluded). 
Also, from the naive approach mentioned in Lecture IV, 


(t+ 1)? =£ + 1 (mod p’) 
for any te G. 
Hence, with t = 1 (mod p), it follows that 


2? = 2 (mod p°), 


where p = 223. This is not true, however: the order of 2 modulo 223 divides 
222 = 2 x 3 x 37, so it may be 2, 3, 6, 37, 74, 111, or 222. All these cases do 
not lead to 2??? = 1 (mod 2232). 

There remains the case where t? + t + 1 = 0 (mod p). Then p = 1 (mod 6) 
and also t # 1 (mod p), hence t is a root of L,(T) = 0 (mod p). But 

LAT) =(T? + T + 1XT? — 57T + 302) + (—301T — 301) 

hence — 301t — 301 = 0 (mod p) and p divides 7 x 43 since t # —1 (mod p). 
Thus p = 43. But, 4p + 1 = 173 is also a prime. Hence from the Legendre 
and Sophie Germain criterion, the first case holds for p = 43, and p = 43 
is also excluded. 

A similar proof may be repeated for the 9th derivatives D? log(x + ey) 
and those obtained by permutation of x, y, z. It still works, but requires a 
more careful analysis. m 


This method cannot be pushed further, without great pain since it leads 
to polynomials L,(T) of degree greater than 6, when j > 11. 

However, Morishima extended this result in 1932 to guarantee that if the 
first case fails for the exponent p, then B,- 1; and B,- 13 must also be divisible 

y P. 

Later I shall describe a very powerful theorem of Krasner, along the same 
lines, but obtained through a totally different method. 

To derive Mirimanoff’s congruences, it is necessary to study in more 
detail the polynomials P,(T). The first property to note is 


D 1 
REIH forj>1. (1.4) 
So, writing 


PT) = T= paT? + + te T+ 
+ (- 1a; j- TiL, (1.5) 
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it follows that 


aa = Gj, j-k for 1 <jisk<j-1. (1.6) 
The computation of the coefficients o, (1 < k <j — 1) is done recursively. 


The following lemma, due to Euler, is needed: 


Lemma 1.1. Let t # 1 be any real number, let g(v) = 1/(1 + et). Then the 
Taylor development of g around v = Q is 


2 n 
glo) = Colt) + Cilo + 0, ++ GOTH 


where 
1 
er 
t 
Ca 
di (1 +t)? 
t — t? 
Ch == 
alt) (1 +1? 
Cl) Dal e eg E 
cz (ty 
and 


s n+1 $ n+1 Lied n+1 CHEN 
ELE i Ja-ır+( 2 Ja 2) ( 3 ya 3)" + 


_,fntl 
+(-1) aes 


forh=1,2,...,n. 
With this lemma, the following may be shown, without much difficulty: 


The coefficients of P(T) are given by the formula: 
ie eet e iy- s ee a ST 

aj, =k C) (k= 17} + (3) (k - 27 "+ +(-]1) 6 Si 
fori<;,1<k<j-1. (1.7) 


So the coefficients a,, are the sums of the first k summands, computed 
for X = k, of 


GN" (X) = X77! — MÉ — 17t + BE Be as 


+(- DAX -j+ I. (1.8) 
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Another fact about these polynomials is that GN" DO) = 0. This is not 
difficult to show; the easiest proof makes use of the theory of finite differences. 
A somewhat involved algebraic manipulation leads to the following: 


Ift<j<p-— 1, then 
(1+ TPFP(T)J=T- FIT? + 31T? —(p— IN LTE 1 (mod p). 
(1.9) 
Rewriting this as 
(1— 7)? JP(—T)=T +2 71T? + 34-173 +--+ +(p- 1 T?!(modp), 
(1.10) 
the Mirimanoff polynomials are brought to light: 
Y9(T)=T+T?’+T?+ + TE, 
oT) = T +2T? +373 + +(p- DT}, 


o{T) HUTT 4 MIT Hp Te, (110) 


Ee + 27T? DEE E (p PTT, 
P(T) = PIT + 27-1 T? + 3P IT? 4 H (p PETE, 
So (1.10) can be rewritten as 
oT) = -(1 — T)?-§P,(—T) (mod p) (1.12) 


forj=2,...,p—1. 
Due to their nice form, the Mirimanoff polynomials display many in- 
teresting properties. First, some obvious facts: 


Et — T”-!) 

T) = — Jl 
p(T) ILT ° (1.13) 
(T) = ,(T) (mod p), (1.14) 
9,(T) = —P,(—T) (mod p). (1.15) 

Let T be the following operator on polynomials of Q[T]: 
dG 
= T —. ; 
T(G) IT (1.16) 
Let T* be the kth iterated operator of! (for k > 1). Then: 

Tod zéien (forj=1,...,p—1), (1.17) 
I'(@p-1) = P, (mod p). (1.18) 


Therefore 
Ir (9) =@;(modp) forj=1,2,...,p—1. 
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Mirimanoff also considered the polynomials 
W(T)=@9,1—T) forj=1,2,...,p. (1.19) 


y(T) has degree p — 1 and is a multiple of 1 — T. Moreover: T?./ divides 
WT) modulo p (for j = 2,..., p — 1) and W,(T) = T”™! — 1 (mod p). 
Let A be the following operator on polynomials of Q[T]: 


dG 
A(G) =(T — DFF (1.20) 
It follows at once that 
AY) =Y; (forj=1,...,p — 2), (1.21) 
A(Yp-1) = Y(T) (mod p). (1.22) 


Further relations between the polynomials 9,7) and y;(T) are the 
following: 


Tell Te 





®%-ıT)=%,-ı(T) = (mod p). (1.23) 
Taking T = —1, then 


prl¢_yjyimt pol . 2» 2 
3 iz È (GIN Ip ~ ®p-1(— 1) = —— (mod p). (1.24) 


j=l J j=l 





The explicit computation of Y,- (T) gives: 
T? T? 1 T* 1 1 
—Wp-(T) = SC 3 (1+3)+ SIE 5) 
Te! 1 1 
++ Er 


pi 5 d (mod p). (1.25) 








This leads, with some further work, to the congruence 


Op-AT) = Thy IER Vp-2(T) (mod p). (1.26) 


At this point, Mirimanoff introduced logarithms. He made the happy 
observation, which may be established by induction on j: Ifj = 1,2,..., p — 2, 
then 


[log(1 — T)P = (- D’j!y,- (T) (mod T”, p), (1.27) 


where the congruence means that the coefficients of T* (for k < p — 1) in 
both sides are congruent modulo p. This served as starting point to prove 
the following congruences: 


1 
Lo, DP = -2| vp aT + EH (mod p). (1.28) 
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As a corollary 
1 T 
le, (N? = @p-2(T)+ (T — reen (mod p). (1.29) 


With this groundwork, Mirimanoff was ready to prove: 


(1B) If x, y, z are pairwise relatively prime integers, not divisible by the 
prime p > 2, and such that x? + y? + 2? = 0, and if —t € G = {x/y, y/x, x/z, 
2/x, y/z,z/y}, then the following congruences are satisfied: 

Pp- vu) = 0 (mod p), 
Pp-2(t)P2(t) = 0 (mod p), 
Pp-3(t)P3(t) = 0 (mod p), 


Pip + 1 u 2 1)2(t) = 0 (mod p). 


PROOF. Despite all the preparation, the proof is still long. So, Tl sketch its 
main points, trying to bring the main idea into view. To begin, the Kummer 
congruences 

Pp-2s(~ t)B>, =0 (mod p) 


are satisfied for 2s = 2, 4,..., p — 3. 
Since o, z(t) = —(1 — 1)?" P,_,,(-1) (mod p), 


Pp-2s(t)B2s =0 (mod p). 
, p — 2 yields 
o, ; = 0 (mod p). (1.30) 
0 (mod p). For this, use (1.29): 


Setting j = 3,5,... 


(a) op- Gë 
SICH i(t)]? eo, 201 + (t - 1)??e,- dei: : ;) (moan) 


Next, @,-2(t) = 0 (mod p) and ul — t) € G so 9,-2(-t/(1 — t)) = 0 (mod p). 
This establishes (a). 
(b) For j = 1,2,..., p — 2, 


Pe Ges) = T + Eed +27? + 372 (E 2) + 34) TP 
+++ (p = IS 2-414 + KI + 
+ (p — INTE! (mod p). 
Start with the defining expression for p,; ,(T), which is rewritten as 
PT) = (1 — DIT + (1) + YT? + (1) + 24 30 
SE CHE EE TTC ERT Mk petit 
+ (V+ 2) +-+++(p—1))T?. (1.31) 
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But 
G+ DE ++ +(p- 1] 


ele Ae 


(as seen in Lecture VI). Since j < p — 2, by von Staudt and Clausen’s theorem 
p does not divide the denominator of each B; (1 < i < j), so the above sum is 
congruent to 0 modulo p and (1.31) becomes the congruence 


@j4+(T) = (1 — DIT + (15 + 2/7? 
toot (14+ 2) ++ (p 1)T?'](modp). (1.32) 


Applying T to oe, (TM — T) and iterating this process leads to the 
congruence (b). 





(c) Forj=1,2,...,p — 2, 
lege. T) 1 1 i\ B 
pe-2 (Gad Piel) Jee (T)+< 5 Pr- (T) + (1) Zoan 
i\ B 
+) Gp—a(T) + BET +++ (modp). 
(1.33) 
To arrive at this congruence it is enough to recall that 
j pj wt! n (j\ B2 
Y+2+ d = Ji 
S RE tet) Se 


J B4 j-3 J Be et bets 
dE dk + 
(d) Forj=1,...,p—2, 


I? o; (T -= T) le P,-ı(T) + r” a EIER P,(T)] (mod p). 
(1.34) 
Indeed, by repeated application of T: 


re? Lo; (TU - NP] = pozaz S SEN ze 


= PPI (GPa - T)? 


= TP? pj ı(T)) 
+ F?~?L9;4(T)e(T)] (modp) (1.35) 


because rtd — T}? = 0(mod p)and (1 — T)? ~! = 1 + @,(T) (mod p). From 
(1.17), IP”? %@,+ ,(T)) = 9,- ı(T) (mod p), showing that (d) is true. 
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(e) Conclusion: 


—2-j 
r?-2 o; (T)PT)) = Pp- 1(T)p,(T) + (? d P»-T)ox(T) 


+ ( E Ke 9,-T)03(T) 


+ 0j43(T)9,-1-(T). (1.36) 


Let —teG, and compute the residue of [?~?~(¢,4,(0/(1 — 0). Since 
t £ 1 (mod p), by (1.13) @,(t) = 0 (mod p). By (a), (1.30) and (1.33): 


Kë Kerg = 0 (mod p). (1.37) 
So, by (1.35) and (1.37) 
r?-?- To, (O01 — 071] = 0 (mod p). (1.38) 
On the other hand, by (1.34), (a), and (1.36), 


re-?-Fo,, (1 - t] = Se a Nos- Jet 
E 4 E =) o, drett 


feet 4 u i P) oror- 


+ orl 1 Al, (1.39) 


By (1.38) the last sum is congruent to 0 modulo p, for j = 1, 2,..., p — 2. 
Letting j = p — 3, p — 4, . . . , successively, gives 


Pp- 2(t)92(t) = 0 (mod p), 
Pp- 3(t)P3(t) = 0 (mod P), 


Pp + 1 u Sr 1)/2(t) = O (mod p). LI 


In 1967, LeLidec gave another form to Mirimanoff congruences, using 
other polynomials. I use the following notation: If p 4 s, let 5 be the unique 
integer such that s = (mod p) and 1 <3 < p — 1. Let s’ be any integer such 
that s's = 1 (mod p). For each n= 1, 2,..., p — 2, let E, be the set of all 
integers s, 1 < s < p — 2, such that (n + Ins < s. 
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LeLidec considered the following polynomials: 


A(T) = YL ST (1.40) 


seE, 


The basic relationship with Mirimanoff polynomials is the following: 


(1C) If x #0, 1 (mod p), then x is a common zero of the congruences 
Pp-1(T) = 0 (mod p) 


A(T) = 0 (mod p) 1.) 


A,-2(T) = 0 (mod p) 
if and only if it is a common zero of the congruences 
Pp-s(T) = 0 (mod p) 
Pp-2(T )p2(T) = 0 (mod p) 


Ppt 1y2T)P~p- 1y2(T) = 0 (mod p). 


From this result, LeLidec proved: 


(1D) If x, y, z are pairwise relatively prime integers, not multiples of the 
odd prime p, if x’ + yP? + 2? =0, and if —te G = {x/y, y/x, x/z, z/x, y/z, z/y}, 
then t is a root of the congruences (1.41). 


2. The Theorem of Krasner 


As I have said, the theorems of Cauchy and Genocchi, of Kummer, Miri- 
manoff, and Morishima established that if the first case fails for the exponent 
p, then the Bernoulli numbers B,_3, B,-5, B,-7, B,-9, B,-11, B,- ı3 are 
congruent to 0 modulo p. 

The idea behind these theorems is that this is an unlikely event. At any 
rate it can be checked in a finite number of steps. 

So it is desirable to obtain stronger restrictions, say, that a longer sequence 
of successive Bernoulli numbers be congruent to 0 modulo p. 

In 1934, Krasner proved a most striking result along this line. Yet, in its 
formulation there is a condition which makes the theorem unfit for any 
practical application: it holds for primes p larger than no = (45!)?? = 
7.0379 x 104934, 

Krasner’s theorem again relies on Kummer’s congruences. So it is neces- 
sary to investigate carefully the quantities D’log(x + e’y). 

I begin with an expression already known since Herschel (1816). An 
analogous formula is found in Hilbert’s Zahlbericht in §132. Let AO denote 
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the first term of the sequence of kth iterated differences obtained from 


101,2 Ch 
As easily seen 
DS fk), 
A0 =F (— pal d (2.1) 
j=0 J 


Herschel’s lemma is the following: 


Lemma 2.1. If x, y are real numbers, x # —y, if t > 0 is a real variable 
and t = e, then for everyi 2z 1 


d' log(x + ey) L A*0! dk log(x + ty) 
- = 2.2 
| dv’ L x k! dt* t=1 ( ) 








Putting 6 = y/(x + y) yields 
k 
E log(x + 2 — (- 1% = 1)!0* 
dt 1=1 
so (2.2) may be rewritten as 
i D i kai 
er logie te 2] EEE ee 
dr v=0 k=1 k 
Let T be an indeterminate and consider the polynomials 
i kg 
MiT)= (N 
k=1 
Among the relevant properties of these polynomials, I mention: 
MOT) mim T, 
and M;,,(T) = T(1 — T)M{T) for i > 1. The leading coefficient of M,(T) 
is (— 100 — 1)! and the coefficient of T is — 1. The roots of M;(T) (for i > 2) 
are real, simple, and belong to the closed unit interval [0,1]. 
Let N(T)= M(T)/T(i — T) = M;_,(T) and P(T) = T'M,(1/T)/1 — T) 


(for i > 2). These polynomials have degree i — 2, no roots in common. Their 
resultant R; is not zero and satisfies the inequality 


|R| < [di - D!P?. (2.4) 


With these preparations, Krasner proved: 


TH (2.3) 


(2A) Let no = (45!)8°. If p is a prime, p > no, if 


k(p) = [Vlog p], 


and if the first case of Fermat's theorem fails for the exponent p, then the k(p) 
Bernoulli numbers B,- 1-2: [ for i = 1, . . . ,k(p)] are congruent to 0 modulo p. 
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Proor. I'll omit the details of the proof and only give the main idea. Consider 
the sequence of polynomials M,(T), M,(T),.... Let jọ > 0 be the smallest 
integer such that j > jp implies 


[G = IPOT? < e0, (2.5) 


It may be seen that jo = 46. 

For every j = 1, 2,..., jo, let R; be the resultant of N (T) and P,(T). By 
(2.4) max|R | < (451° = no. 

If p > n and there exist x, y, z, pairwise relatively prime integers, not 
multiples of p, such that x? + y” + z?” = 0, it must be shown that B,.; =0 
(mod p) for i = 3, 5,..., 2k(p) + 1. Assume the contrary for some odd index 
j,3 <j < 2k(p) + 1. By Kummere congruence, 


d’log(x + e’y) = 
( ES Lies 0 (mod p). 
By (2.3) MAI =0(mod p), where 0 = y/(x + y). Since 6#0, 1 (mod p), 
N (0) = 0 (mod p). Again, by Kummer’s criterion 


d’log(x + ez) = 
( ES 1. = 0 (mod p) 


and similarly M,(1/0) = 0 (mod p). Therefore P,(0) = 0 (mod p), so R; = 0 
(mod p). But R; # 0, hence p < |Rjj. 
If j < jo, then |R,| < no < p. If jo <j, then 


IR;| < [V = 1)1]?4°2 < el < ek’ < goer — p. 


In both cases, there is a contradiction. 


Because of his various results, Vandiver in his papers of 1946, 1953 stated 
his belief that the first case is true. Krasner wrote in 1953 to Vandiver about 
this: 


Concerning your discussion of the truth of Fermat’s theorem, I think as 
you do, that in Case I it is certainly true. Your argument is in order. Even the 
numbers p not satisfying my criterion given in Theorem VIII of your article 
must be very exceptional. But I think that my preceding result given in the 
Comptes Rendus of 1934 furnishes even much stronger arguments in this sense. 
It seems quite unlikely that all the [3/log pl consecutive Bernoullian numbers 
Bp~1-2 1 < i < [log p], are divisible by p, if p is not too small. If you admit 
that the probability for an unknown Bernoullian number to be divisible by 
p is 1/p such a divisibility has only the probability 


1\ iog p] a 
p leen 


and a simple calculation shows that the mathematical expectation of the 
number of primes p > n satisfying this condition does not exceed 2/nY!®”. 
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When we use a little different and very likely hypothesis that, if f, is the 
frequency of the Bernoullian numbers divisible by p, 


ee Pp = 1 
n(x) 


H 


the result is not very different, with only maybe, a greater dispersion. 


3. The Theorems of Wieferich and Mirimanoff 


As I mentioned already, Wieferich’s proof of his famous theorem is quite 
difficult. Mirimanoff gave a simpler proof in 1909, soon after the original. 
He made use of his polynomials. His manipulations required a formula due 
to Euler, for the alternate sum of odd powers of consecutive integers: 


1 2k+1\2 -1 
j-1:2k+1 rr (wtl JI y2k+1 


2k+1\2*-1 B 
d ) 2 Byn2*-2 





2k 
Bn 


ıM = 


jel 





(kri gn 
i 2k-1} 2k 2 
22k+2 = 22k+2 =f 


Za Pen ee GT EH 


With this formula, Mirimanoff proved (p > 3 and T is an indeterminate): 
lier? — 2P-2 4 Bp-2 ee (p Se Ip "li 


P-N2-1 
= HT reueg 0 1 ) 2 


— _ = es et 
+ i "E Bio ie T) ++ (? sant: d 





Bam BEEN 


3 4 p-4) p—3 


2P 2 TESI 
= B 


; (3.2) 
p-1 ”! T+1 





I restate the theorem of Wieferich (1909) and give Mirimanoff’s proof: 


(3A) If the first case of Fermat’s last theorem fails for the exponent p, then 
2?-! = 1 (mod p°). 


PROOF. Assume there exist integers x, y, z, not multiples of p, such that 
xP + yP + 2? = 0. Let G = {x/y, y/x, x/z, z/x, y/z, z/y}. By Mirimanoff 
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congruences, if t e G, then 
Pp-2s(—1)B2, = 0 (mod p) 
for s = 1, 2,...,(p — 3)/2, and also 
P,-1(-9 = 0 (mod p). 


From the theorem of von Staudt and Clausen, it follows that (2? — 2)B 
is p-integral. Hence, by (3.2), 


p-1 


1772 — 2772 4 3772 — ..._ (p — 1)P2 = 0 (mod p). (3.3) 
By (1.24), 
-2 -2 -2 22-2 
1772 — 222 4 3Pp-2 _..._(p- DÉI = S (mod p). 
Hence 2?~! = 1 (mod p°). o 


The quantity q,(2) = (2?~ ' — 1)/pis an integer, by Fermat’s little theorem. 
It is called the Fermat quotient of p, with base 2. To say that 281 = 1 (mod p°) 
is equivalent to saying q,(2) = 0 (mod p). 

From the time of its discovery, it was quite apparent that Wieferich’s 
theorem represented a noteworthy advance over all previous results. 

The search for primes p satisfying the condition began immediately. It 
was not until 1913 that Meissner discovered after long calculations the first 
example: 

21092 = 1 (mod 10937). 


A second example was later encountered by Beeger (1922): 
23510 = 1 (mod 35117). 


These congruences may also be proved in a rather short, but perhaps 
artificial way (see Landau, 1927, Guy 1967). 

As I said in my first lecture, no other example exists with p < 3 x 10°. 
Calculations have been performed by various researchers recently with 
computers, and brought to the above limit by Brillhart, Tonascia, and Wein- 
berger in 1971. So, for every prime p # 1093, 3511, p < 3 x 10°, the theorem 
of Wieferich and the computations above, guarantee that the first case holds 
for p. 

Having understood the reasons behind Wieferich’s theorem, Mirimanoff 
proved an analogous criterion, this time for the base 3: 


(3B) If the first case of Fermat’s last theorem fails for the exponent p, then 


3?-1 = 1 (mod p°). 


In other notation, the Fermat quotient with base 3, q,(3) = (3? * — 1)/p = 
0 (mod p). 
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The proof of this theorem is substantially more difficult. Let it be only 
said that it makes much use of properties of the Mirimanoff polynomials 
and also of logarithms. 

A computation showed that for p = 1093 and 3511 the Fermat quotient 
q,(3) # 0 (mod p). This guarantees that the first case also holds for these two 
exponents, which were not covered by Wieferich’s theorem. 

In 1910, Frobenius gave a proof of the theorems of Wieferich and Miri- 
manoff. His proof was algebraic, without the use of Kummer’s congruences. 

Soon after, in 1912, Furtwängler proved a very general theorem, using 
class field theory, or more precisely, Eisenstein’s reciprocity law. As a 
corollary, he derived both theorems of Wieferich and Mirimanoff. These 
theorems will be considered in my next lecture. 

In the literature there is a paper by Linkovski (1968), in which he claims: 
If the first case of Fermat’s theorems fails for p, then 2?~! = 1 (mod p°). 
This would represent an outstanding strengthening of Wieferich’s theorem. 
However, Linkovski’s proof is not correct, since it is based on the following 
assertion published by Grebeniuk in 1956: 

If x, y, z are integers, not multiples of p, such that x? + y? + z? = 0, if 
l divides x + y + z, and gcd(l,xyz) = 1, then l divides 2?~1 — 1. In 1975, 
Gandhi and Stuff analyzed Grebeniuk’s proof and found a mistaken de- 
duction, so the statements of Grebeniuk and Linkovski are now questionable. 

Of course, this is only one of so many mistakes made about Fermat’s 
theorem, by outstanding mathematicians (like Kummer himself), as well as 
by good and not-so-good mathematicians. 

I’m only giving these facts, in order to tell a story. Terjanian doubted the 
veracity of the congruence obtained by Linkovski. He wrote to Peschl, since 
that congruence is quoted in Klösgen’s monograph—but not used in it—and 
Klösgen was Peschl’s student. Peschl asked Hasse, during his visit to Bonn, 
for the Colloquium in Krull’s memory. Hasse then wrote to Dr. Kotov, of 
Minsk, with whom he exchanges stamps, asking to translate Linkovski’s 
paper. In reply, Kotov wrote: 


In the paper of Grebeniuk, which is quoted by Linkovski, there is a mistake. 
This was established by my colleague V. I. Bernik (also a student of SprindZuk). 
The following happened. A mathematician Jepimaschko of Vitebsk produced 
in 1970 a proof of Fermat’s last theorem. This proof was based on the paper 
of Grebeniuk, quoted by Linkovski. Bernik and I (Kotov) have checked all 
the arguments of Jepimaschko without finding any mistake. Later, Bernik 
checked the paper of Grebeniuk, on which Jepimaschko was based, and there 
he found the mistake. We knew the work of Linkovski, but we have not called 
his attention to the mistake of Grebeniuk. We had already had so much 
trouble to understand Jepimaschko’s that we didn’t wish to enter again into 
unhealthy discussions about Fermat’s problem. 


Despite the best of intentions, and the cleverest of our manipulations, 
beware of a proof of Fermat’s theorem. A mistake may be carefully hidden. 
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4. Fermat’s Theorem and the Mersenne Primes 


There are various nice consequences of the theorems of Wieferich and 
Mirimanoff. I show how it is possible to deduce that the first case of Fermat’s 
theorem holds for a class of primes which includes the Mersenne primes. 

I begin with this easy lemma: 


Lemma 4.1. Let p be an odd prime, k not a multiple of p. Then kp cannot 
be written in the form 
kp=+tm+n, 


where mP"! = 1 (mod p?) and n?~! = 1 (mod p°). 
Mirimanoff(1910), Landau (1913), and Vandiver (1914) proved repeatedly: 


(4A) If p = 273° +1 or p= +27 + 3°, where a> 0, b > 0, then the first 
case of Fermat’s last theorem holds for the exponent p. 


Proor. If the theorem fails for p, then 2?~' =1 (mod p?) and 37 "!=1 
(mod p?). By the lemma (with k = 1), p cannot be ofthe form indicated. oO 


In particular, if p = 2° + Lisa prime, the first case holds for p. 

It is quite easy to see that if p = 2° + 1 is a prime, then a itself is a power 
of 2, and so p = 22” + 1. The number F, = 2?” + 1 is called a Fermat number. 
The only known prime Fermat numbers are Fo, F,, F2, F3, F4. Euler dis- 
covered that F, is a multiple of 641. It has been conjectured that there are 
only finitely many prime Fermat numbers, and perhaps only the ones 
already known. This is a very difficult question. 

On the other hand, if p = 2° — 1 is a prime, then a is necessarily equal to 
a prime, a = q. Then p = 24 — 1 = M, is a Mersenne number. At the present 
there are 24 known Mersenne numbers which are primes. The largest ones 
are M19937, with 6002 digits, discovered by Tuckermann in 1971, M21701 
with 6533 digits, discovered by Nickel and Noll in 1978 and M23209, M4497, 
discovered by Nelson and Slavinsky.’ 


In view of (4A) and the likelihood that at any given moment the largest 
known prime will be a Mersenne prime, I may state: 


The first case of Fermat’s last theorem holds for the largest prime known 
today and this is likely to be true at any future time! 


Let me add that Schinzel has conjectured that there exist infinitely many 
square-free Mersenne numbers. Rotkiewicz proved in 1965 that if this 
conjecture is true, then there exist an infinite number of primes p such that 
2?-1 Æ | (mod p’). And therefore, by Wieferich’s theorem, there are an 
infinite number of primes p for which the first case of Fermat’s theorem holds. 


1 See Footnote 1 of Lecture II. 
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5. Summation Criteria 


The earliest criterion for the first case involving summations was discovered 
by Cauchy in 1847: 
If the first case fails for p, then 
(p= 1/2 
yj? * = 0 (mod p). (5.1) 
j=1 

In view of the expression of summations, like the above, in terms of 
Bernoulli numbers, it was not unexpected to arrive at criteria for the first 
case involving summations. This idea was exploited by E. Lehmer and 
Vandiver, among others. Quite recently, I have also contributed some new 
criteria of the same kind. 

The first group of quotients involves sums of powers p — 2 of terms in 
arithmetic progression, Fermat quotients q,(2), q,(3) and the Wilson quotient 
W(p), which I now define. 

Wilson’s theorem says that 

(p — 1)! = —1 (mod p) 
so 
(p—-—1)!+1 


WwW a a 
(p) S (5.2) 


is an integer, called Wilson quotient of p. 
In 1938, Emma Lehmer proved the following congruences: 


(p—1)/2 











Z (P= 2) = ADE — pW] mod p’), (5.3) 
[p/3] = 
Zn - 07 am Eege when p > 3, (5.4 
[p/4] = 2 
d (5.5) 
[p/6] boss 4q.(2 34 (3 8 
E (p — gie? _ [44,(2) + 34,(3)][1 DW] + 2q,(2)4,(3) BER 
when p > 5, (5.6) 
(p—1)/2 
2 pts —2q,(2)[1 — pW(p)] + 2p[q,(2)]? (mod p?). (5.7) 


From these congruences, Emma Lehmer derived the ones below: (Let me 
add that already in 1901, Glaisher had established (5.8) and (5.10), while in 
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1905, Lerch proved (5.8) and (5.9) for the modulus p.) 








(p- 1/2 1 D 5 1 5 e d 5 
Š poy a )— 5 PL def )]* (mod p°), (5.8) 
(p/3] 1 1 1 
2, 32 4,3) = 7 p[g,(3)]? (mod p?) when p > 3, (5.9) 
= 
[p/4) 1 3 3 
EECH =7 9,(2) S pla,(2)]* (mod p) when p > 3, (5.10) 
j=l 
SLL Lamm — pl etal? + 0,07?) (mod si 
poe 4” 3 9 PS glav AKG p 
when p > 5, (5.11) 
(p—1)/2 4 
> F = —2q,(2) + p[4,(2)|? (mod p°). (5.12) 
j=l 


Detailed proofs of these congruences will appear in my book quoted above. 

Combining the above congruences with the theorems of Wieferich and 
Mirimanoff, Emma Lehmer obtained various criteria given in (5A) below. 
As a matter of fact, the case n = 2 had been proved by Sylvester in 1861, 
while for n = 3 it was discovered by Lerch in 1905. Subsequently, Yamada 
proved the result for n = 3, 6 (see his second paper of 1941, correcting errors 
in the one of 1939). 


(SA) If the first case of Fermat’s theorem fails for the exponent p and if 


n = 2,3, 4,6, then 
[pin] 4 


2 


PROOF. Just use the theorems of Wieferich, Mirimanoff and the congruences, 
modulo p, (5.8), (5.9), (5.10), (5-11). O 


j = 0 (mod p). 


The next result, due to Vandiver, is much more difficult to derive. It is 
based on a formula due to Frobenius (1914) connecting Bernoulli numbers 
and the Mirimanoff polynomials. 

Let p > 2 be a prime, let m > 2 and let čo = LE En, be the mth 
roots of 1. Let p,(X) denote the nth Mirimanoff polynomial. 


Lemma 5.1. Let n > 1 andO<1<m-— 1. Then 
Wl Eë 


(mB + I" — B" =(—1)"n 2 ES? (mod p). 


I have used in the left-hand side the symbolic notation: B is an indeter- 
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minate, the left-hand side is computed as a polynomial in B and each power 
B“ is to be interpreted as being the Bernoulli number B,. 
A special case is 


m-i st ) 


(mB + 1)P-2 =2 > "ec (mod p). (5.13) 


Setting | = 0 yields a congruence obtained explicitly by Vandiver (1917): 


m" 





=f mat P 
—B=(- E ZE (mod j (5.14) 


Moreover, if n = p — 1 and py m, then 


m-1 


agin) = T 22E (mod py (5.15) 


With these formulas, Vandiver proved in 1925: 


(SB) If the first case of Fermat's theorem fails for the exponent p, then 
GE 
3 = =0(mod p). 
fett 
I totally omit his ingenious but long proof. 
In 1933, Schwindt was able to transform Vandiver’s criterion. He proved: 


Lemma 5.2. If p > 3, then 


ua) (wis) į 
5% =} z (mod p). 


j=1) j=1 
From this lemma, it follows: 


(SC) If the first case of Fermat’s theorem fails for the exponent p, then 
[p/6] 


» = = 0 (mod p). 
j=1J 


I will conclude the report on summation criteria for the first case, by 
giving my own results. The statements of these make use of the Bernoulli 
polynomials 


i=0 


BX% =F () oke (5.16) 


Their constant term is B,, the corresponding Bernoulli number. It is not 
my intention here to say more than strictly necessary about these polynomials 
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One striking fact is that for a = 4, 4, 4,844 8 there are formulas giving 
B,,(a), in terms of n and the Bernoulli numbers. Explicitly: 


sl = -(1 - = Ban (5.17) 
SE = Ban (3) = -} (i z z) Ban, (5.18) 
Ban L = Ban (3) =~ a (1 = Je, (5.19) 
Ban (5) = B,, (2) = dt — =) (: — zm) Ban- (5.20) 


However, no corresponding formulas are known for the values of Bo, + 1(X), 
say at a = 4,4, 4, 3, 3, only in terms of Bernoulli numbers. 

I also need to use certain other congruences proved by Emma Lehmer 
(with p > 3): 


EI 32-3 (p—2 l 
2 (p- 3P = CES SS pe pB,-3 — a DI (mod p?), (5.21) 


where p = s (mod 3) so s = 1 or 2, and 











p-2 


where p = t (mod 6), so t = 1 or 5. 
Using these congruences, I establish: 


[p/6] Gë? ee t 
KS (p D pi"? = Pe pB,-3 = a UI (mod p?), (5.22) 
j=l 


Lemma 5.3. If p > 3, then 


L. 5B, - 2) = B,-2(8) (mod p). 
2. az (— 1U + 1)3(2/*! — 5)B, = 0 (mod p). 


Lemma 5.4. If B,—(2n+1) = 0 (mod p), with 1 <n < (p — 3)/2, then 


3i 1 
> -ry = 0 (mod p) 
fest ft 
and 

pie) 1 


ja 


And from these lemmas, I conclude: 


(SD) If the first case of Fermat’s theorem fails for the exponent p, then 
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1. B,-2(}) = 0 (mod), B,- 2(8) = 0 (mod p). 
2. 2 D174 i? > = B,_ 2(s/3) (mod p°) and 2 528, j~? = B,_ ,(t/6) (mod p°), 
where s = 1 or 2,t = 1 or 5, p = s (mod 3), p= t (mod 6). 


Using the same method, I proved: 


(SE) If the first case of F ermat’s theorem fails for the exponent p, then 


GE 
% = = 0 (mod p) 
Teil 


and 
[p/6] 


3 Ek 0 (mod p) 
GC 
for r = 1, 3, 5, 7, 9, 11, 13. 


Note that the case r = 1 was already stated in (5A). Note also that using 


Krasner’s theorem (2A), if p is sufficiently large the last congruences must 
also hold for r odd, 1 < r < 2¥/log p. 


6. Fermat Quotient Criteria 


The important theorems of Wieferich and Mirimanoff were extended by 
various authors to Fermat quotients with other bases: q,(m) = (m?! — 1)/p. 
In 1914, Vandiver proved: 
(6A) If the first case of Fermat’s theorem fails for the exponent p, then 


4p(5) = 0 (mod p) 
and also 


ay eatin eee 0 (mod p) 
re er 
2 3 Cas i 


In the same year, Frobenius proved: 
(6B) If the first case of Fermat’s theorem fails for the exponent p, then 


g,(11)=0(modp) and Gp(17) = 0 (mod p). 


Also, if p = —1 (mod.6), then q,(7) = 0 (mod p), q,(13) = 0 (mod p), and 
9,(19) = 0 (mod p). 


New advances were made by Pollaczek in 1917 when he proved: 


(6C) If the first case of Fermat’s theorem fails for the exponent p and if m 
is any prime, m < 31, then with the exception of at most finitely many primes p, 


g,(m) = 0 (mod p). 
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(6D) If x, y, z are integers, not multiples of p, such that x? + y? + z? = 0, 
then x? + xy + y? # 0 (mod p). 


These theorems are difficult to prove. 
In 1931, Morishima completed Pollaczek’s result (6C), showing: 


(6E) If the first case of Fermat's theorem fails for the exponent p, then, 
without any exception, q,(m) = 0 (mod p) for all primes m < 31. 


As a matter of fact, Morishima proved also that q,(m) = 0 (mod p) for 
m = 37, 41, 43, with finitely many exceptional primes p. Rosser wrote two 
papers in 1940 and 1941, in which he eliminated the exceptional primes in 
Morishima‘s result. However, in his doctoral thesis in 1948, at Bossers 
suggestion, Gunderson thoroughly examined the paper of Morishima and 
found large gaps in the proofs. He proceeded to correct whatever he could 
and so he succeeded in establishing on firm ground (6E) above. But Agoh and 
Yamaguchi, who are presently visiting Queen’s University, and have worked 
with Morishima, assured me that Morishima’s proof really has no gaps. 

Later, in my eleventh lecture, PI indicate how the Lehmers, Rosser, and 
Gunderson used these criteria involving Fermat’s quotient to assure the 
validity of the first case of Fermat’s theorem for quite an extended range of 
the exponent p. 

Now I want to indicate some other applications of these criteria involving 
Fermat’s quotients. In a way analogous to the proof of (4A), Spunar proved 
the following result in 1931; it was rediscovered twice, by Gottschalk in 1938 
and Ferentinou-Nicolacopoulou in 1963. 


(6F) Let p be an odd prime, and assume that there exists k, not a multiple of 
p, such that kp = +m + n, where each prime factor of m and of n is at most 
equal to 43. Then the first case of Fermat's theorem holds for p. 


Proor. Otherwise, by the above criteria m?~!=1(modp’), n°7! =1 
(mod p?) and this contradicts Lemma 4.1. Oo 


For example, this gives a direct way of seeing that the first case holds for 
p = 1093 and p = 3511 since 


1093 = 2? + 3? x 11? 
and 
3511 =11 +2? x5? x7. 


In fact many numbers may be written in the form indicated in (6F). But, 
is this set infinite? This is an open question. Just to comment on this problem, 
I want to compare it with a very useful classical theorem of Bang (1886), 
Zsigmondy (1892), Birkhoff and Vandiver (1904). This theorem says the 
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following: 


(6G) If a> b>1 and gcd(a,b) = 1, then for every n> 2, a +n” has a 
prime factor p, which does not divide any of the numbers a” + b" (with corre- 
sponding signs) and 1 <m < n. The only exceptions to this statement are 
trivial, namely 2? + 1, 26 — 1, (2°7* + h)? — (2°! — h} for h odd, 2°-' >h. 


So, taking a < 43, there are an infinite number of distinct primes p,, with 
some k, such that k,p, = a" — b" (for example). However, there is no guar- 
antee that p, does not divide k,. So these primes are not necessarily of the 
kind indicated in (6F). 

In 1968, Puccioni examined what would follow if the set P43 of all primes 
of the form in (6F), is assumed to be finite. 

For each prime p let 


M, = {l prime|p'”' = 1 (mod !°)}. 


Numerical computations show that the sets Æ, are quite small. In fact 
very few examples of such numbers are known. 
Puccioni’s theorem says: 


(6H) If P43 is a finite set, then for every prime p < 43, such that p # 
+1 (mod 8) (i.e., p = 2, 3, 5, 11, 13, 19, 29, 37, 43) the set M, is infinite. 


This seems quite unlikely, however nothing has as yet been proved to 
the contrary. The offshoot of this discussion is, once more, that it is not yet 
known, even for the first case, whether Fermat’s theorem holds for infinitely 
many prime exponents. 

To conclude this section, I should mention that there are interesting 
papers of Lerch (1905) and Johnson (1977) about the properties of the Fermat 
quotient and the vanishing of these quotients modulo p. 
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LECTURE IX 


The Power of Class Field Theory 


In 1912, Furtwängler used class field theory to derive two important criteria 
about the first case of Fermat’s last theorem. As corollaries, he then gave 
new proofs of the theorems of Wieferich and Mirimanoff. In this way, the 
methods of class field theory entered into the game. 

It is my intention in this lecture to give a succinct overview of those parts 
of the theory which are relevant to Fermat’s problem. In no way will I 
attempt a systematic treatment of the theory. The advantage of this approach 
is that it brings directly into focus the tools that are most useful. Usually 
these lie hidden behind more refined statements. 


1. The Power Residue Symbol 


Let p be an odd prime, let ¢ be a primitive pth root of 1, K = OCL A the ring 
of integers of K, and 4 = 1 — ¢. 

The first fact, which is the cornerstone of the theory, is quite simple and 
I have already mentioned it in Lecture VII, §5. 


(1A) If Q is a prime ideal of A, Q # A}, and if ae A\Q, then there exists a 
unique integer a,0 < a < p — 1, such that 
aN- 1p — Co (mod Q). 
N(Q) denotes the absolute norm of Q, and I already noted before that 


N(Q) = al, where f is the order of q modulo p (q being the only prime such 
that Q divides Ag). 
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In view of the above result, the following definition makes sense: 


A ={ (1.1) 
when Q ¥ A), « e AQ. 


The symbol {—} has the following properties: 


(1B) If x, Be AQ, Q # AA, then 
a. « = ß (mod Q) implies {a/Q} = {B/Q}. 


b. {1/0} = 1. 
c. {4/Q}{B/Q} = {aß/Q}. 
d. Lang) = 1. 


This justifies calling the mapping o e A\Q > {a/Q}, the pth power residue 
character or symbol defined by the prime ideal Q # AA. 

By multiplicativity, the symbol may be defined for all ideals J of A such 
that AAJ. If J = [ JE, Of, and ifa € A, ad (Jr, Q; (or in other words, if 


gcd(Au,J) = A), define 
a m o fi 
PAS e 


kl 


For the extended symbol, the properties are analogous: 


In particular 


(1C) If the symbols below make sense, then 


a = B (mod J) implies {a/J} = {B/J}. 
DUU) = 1. 

{EIERN} = {aßl9). 

{oP/J} = 1. 

{II = {a/J loft, 


Ifa, ße A\AA, and ifgcd(Aax,Aß) = A, then the following notation is used: 


(aj G) 


I recall the notion of a semi-primary element from Lecture V: Any element 
a ¢ AA such that there exists me Z for which a = m (mod AJ’). I now list 
in a lemma the main properties of semi-primary integers, of which the first 
two were quoted before: 


ene E 


Lemma 1.1. 


1. If x e A\AA, there exists a root of unity © such that Ca is semi-primary. 
2. If a, ße A\AA are semi-primary, then so are aß and «/ß. 
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3. If «x e A\AA, then a? is semi-primary. 
4. If ae A\AA and g is a real number, then « is semi-primary. 
5. If a is a unit and « is semi-primary, then a is a real number. 


The main theorem for the Legendre and Jacobi symbol is Gauss’s qua- 
dratic reciprocity law. This is because it allows the computation of the 
symbol. Gauss also proved the reciprocity law for the biquadratic symbol 
and later (1844) Eisenstein discovered the reciprocity law for the cubic 
residue symbol. A more elementary proof was published recently by Kaplan 
(1969) for the biquadratic case and by Hayashi (1974) for the cubic case. 

The more general reciprocity law for the power residue symbol was the 
object of deep studies by Eisenstein (1850) and later by Kummer (from 1850 
to 1887). 

Using the Jacobi cyclotomic functions, Eisenstein proved the following 
special reciprocity law for the power-residue symbol: 


(1D) If mis an integer, p { m, if « € A is semi-primary, and gcd(Aa,Am) = A, 


then 
ike ee fia 
a = m 
A proof, in modern notation, may be found in volume ITI of Landau’s 
book (1927). 


The following corollaries are required in the applications to Fermat’s 
theorem: 


(1E) Let m be an integer, p { m, let a e A be semi-primary, gcd(Aa,Am) = A. 
If Aa is the pth power of some ideal of A, Ax = J”, then {a/m} = 1. 


Similarly, 


(1F) Let m be an integer, p¥ m, let « e A\AA, and gcd(Aa,Am) = A. If a 
is a real number, then {a/m} = 1. 


(1G) If q is a prime, q # p, and if {t/q} = 1, then q?~' = 1(modp?). 


The last result is used directly in studying Fermat’s theorem. The proofs 
of these corollaries are also in Landau’s book. 


2. Kummer Extensions 


In the study of the cyclotomic field K = Q(¢) (¢ a primitive pth root of 1, 
p an odd prime) there comes a time when it is necessary to study the cyclic 
extensions of K having degree p. Kummer considered such extensions and 
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he showed that they are of the form L = K(x/«), where « e A is not the pth 
power of any element of K. 

Let B be the ring of integers of L. Let o be the generator of the Galois 
group of L|K such that alga) = Cafe. 

The problem to investigate was the decomposition of (nonzero) prime 
ideals P of A in the extension L|K. Writing BP = P(PS--- P,, where 
P,,..., P, are distinct prime ideals of B and the exponents are equal, e > 1, 
the problem is the determination of e, g and f, where the relative norm of 
P, is Nijx(P;) = PI Since efg = [L:K] = p, the only possibilities are as 
follows: 

e=p, f=9=1 P is ramified, 
f=p, e=g=l P is inert, 


g =P, e=f=1 P is split. 
The situation is completely known. 


Case I. P| Aa. 
Writing Aa = P'J, where P ¥ J, then: 


a. Ifp +t, then BP = PP, where P is a prime ideal of B, so P is ramified. 
b. If plz, then there exists a € A such that L = K(2/’) and P y Aa’. 


Case II. P% Ax and P # A4 


a. If there exists ße A such that a = f” (mod P), then P is split. 
b. Otherwise, P is inert. 


Case III. P = AA X Ax. 


a. If there exists ße A such that a = £” (mod Ad?*'), then AA splits. 

b. If there exists ße A such that a = ß? (mod AAP), but there does not exist 
y e A such that a = y” (mod AA?*"), then AA is inert. 

c. If for every Be A, « # ß (mod A4”), then Ad is ramified. 


The above results may easily be rephrased in terms of the power residue 
symbol. Assume that P # AA is a prime ideal unramified in L|K, where 
a ¢ P. Then {a/P} = 1 exactly when P splits in L|K, and, more generally, 
the order of {a/P} in the cyclic group of roots of 1 is equal to the inertial 
degree f of P in the extension L|K. 


3. The Main Theorems of Furtwangler 


These are just two theorems which brought class field theory as a method of 
studying Fermat’s theorem. 
Furtwangler first showed (1912): 
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(3A) If p is an odd prime and x, y, z are relatively prime integers such that 
xP + y? + 2? = 0, ifr is any natural number, p X r, p X z, gcd(r,z) = 1, then 


> + = -L 
r 


His first main theorem is the following: 


(3B) If p is an odd prime, x, y, z are relatively prime integers such that 
x? + y? + 2? = 0, and r is a natural number such that r|x, pX x, thenr?~1 =1 
(mod p°). 





PROOF. It may be assumed, for example, that pt: Since pr and r is re- 
latively prime to y and to z, then by (3A) 


pao 
r 


But (x + 7 *y = {~ *y (mod Ar), hence by (1C) 


EAE 


By (IF), {y/r} = 1, hence (ri "sl Since p} x, {t/r} = 1, hence by (1G) 
LI 





r?! = | (mod p°). 


The proof of the second main theorem of Furtwängler is not much more 
difficult: 


(3C) If p is an odd prime, x, y, z are relatively prime integers such that 
x? + y? + z? = D and r is a natural number, such that r|x — y, p¥ x? — y?, 
then r?r"! = 1 (mod p°). 


With these theorems, Furtwängler obtained both theorems of Wieferich 
and Mirimanoft in a very natural way. Here is the shortest proof of Wieferich 
criterion (backed however by considerable theory): 


Proor. If p #2 and x? + y? + z? = 0, p} xyz, ged(x,y,z) = 1, then at least 
one of the integers is even, say 2|x. By (3B), 2?~! = 1 (mod p°). o 


And now, a very short proof of Mirimanoff’s theorem. 


Proor. If 3| xyz, then by the first theorem of Furtwängler 3°"! = 1 (mod p?). 
If 3/4 xyz, then x? = x (mod 3), y? = y (mod 3), and z? = z (mod 3). So 
O=xP + yP+ 7? =x+y+z2= +1+1+1 (mod3). Therefore X=y=z 
(mod 3). 
Note that p does not divide all three numbers x — y, y — z,z — x, otherwise 
O=x? +y? + 2? =x +y+z= 3x (modp). Since p #3, p|x, against the 
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hypothesis. Without loss of generality p/ x — y. Since —z = x + y (mod p) 
and pz, then pfx+y. But 3|x — y, so by the second theorem of 
Furtwängler, 3’~! = 1 (mod p°). m 


In view of their importance, the theorems of Furtwängler were proved 
again, extended and generalized. 
Noteworthy are the two theorems proved by McDonnell in 1930: 


(3D) Let p be an odd prime, let x, y, z be relatively prime integers such that 
xP + yP + 2? = 0). 


1. If p¥ xy + yz + zx and r|x? — yz, then r?~! = 1 (mod p°). 
2. If pX x(y — 2)(x? + yz) and r|x? + yz, then r?"' = 1 (mod p°). 


These theorems are not easily applicable, because the assumptions cannot 
be verified in practice. 

In 1919 Vandiver gave the following application of Furtwangler’s theorem. 
The same result, assuming that the first case fails, was indicated in Lecture IV, 
(3A). 


(3E) If p #2 and if x, y, z are relatively prime integers such that 
xP + y? + z? = 0, then 
x? = x (mod p°) 
y’ = y (mod p°) 
zP = z (mod p°) 


and x + y + z = 0 (mod p°). Moreover, if p|z, then p° |z. 


Furtwängler also proved the following result which parallels the theorem 
of Sophie Germain: 


(3F) Let p #2 and q = kp + 1 be primes. Assume: 


1. The first case of Fermat’s theorem holds for every prime exponent dividing k. 
2. The congruence XP + Y? + Z? = 0 (modg) has only the trivial solution. 


Then the first case of Fermat’s theorem holds for p. 


4. The Method of Singular Integers 


While writing his important papers in class field theory, Furtwängler in- 
troduced the singular integers. These were later used by Takagi, Fueter, and 
Inkeri to give new proofs of the two main theorems of Furtwängler. These 
proofs ironically avoided class field theory, or more precisely, the use of 
Eisenstein’s reciprocity law. 
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I keep the same notations: p # 2, ¢, K, A, 4. An element ae A isa singular 
integer if there exists an ideal J such that Aa = J? (it is possible that J be a 
principal ideal). More generally, I shall consider elements a such that Aa = J“, 
where J is some ideal and a is odd, a > 1. 

Let g be a primitive root modulo p, ¢:¢ +> (3 the generator of the Galois 
group of K IO For every i > 0 let g; be the unique integer such that 1 < 
gi < p — 1 and g; = g' (mod p). Similarly, for every i < Oletg;,1 <g;< p-—1, 
be defined by g,g~' = 1 (mod p). Let h; = (gg; — 9;+ ı)/p for every ie Z. So 
hez. 

Consider the polynomials in a: 


p-2 
G(o) = H Dë (4.1) 


p-2 
H(o) = E hø. (4.2) 


Following Inkeri (1948), I will describe the action of these polynomials 
on the generalized singular integers: 


(4A) Let oe A and assume that Aa = J*, where J is some ideal and a> 1 
is odd. Then: 


1. xO = DÉI. where <u<p-1,ßeA. 
2. oH) = Dn. where <v<p-1,yeA. 


Actually, these results had been obtained for a = p by Fueter (1922) and 
Takagi (1922), but their proofs, contrary to Inkeri’s, used some facts from 
class field theory. 

Inkeri (1946) also proved a more precise result, under stronger conditions: 


(4B) If xe A is semi-primary, and Aa = J“, where a is odd and p|a, then: 


1. a0 = Be with Be A. 
2. aM) = ag with y € A. 





With this method, Fueter in 1922 gave a proof of Kummer’s congruences 
(explained in Lecture VID). As he stated in his paper, he wanted to show: 
“,.. how all the necessary theorems to derive Kummer’s congruences are 
found already in the Zahlbericht [of Hilbert] and how, with a few strokes, 
Kummer’s criterion arises. Yes, and still more! Also Mirimanoff’s form and 
the criterion of Wieferich-Furtwangler may be immediately obtained, and 
therefore for the latter, it is required much less than Eisenstein’s reciprocity 
law”. 

Inkeri also gave in 1948 another similar proof of Kummer’s congruences 
for the first case. 

In 1857, Kummer obtained a formula, which was extended by Fueter 
(1922). 
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(4C) Let « € A be such that Aa = J? (a is a singular integer). Then for every 
a=1,2,...,p— 2, ae Daf where O< u, < p — 1, Ya€ A and 


F | ai -j 
1,(0) 2 (| ` ` a, 


In 1948, Inkeri gave new proofs of Furtwängler’s theorems, using the 
method of singular integers. 

In 1933, Moriya extended the theorems of Furtwängler for Fermat’s 
equation with exponents p°, n > 1. In 1946, using the method of singular 
integers, Inkeri gave a new proof of Moriya’s theorem, and extended the 
theorems of McDonnell for the prime-power exponents. 

These are Moriya’s theorems: 





(4D) Let n> 1, let p 4 2 be a prime, and assume that there exist relatively 
prime integers x, y, z such that x?" + yP" + zP" = 0, Let r be a natural number, 
satisfying any one of the following conditions: 


1. r|x, pyx 
2. r|x= y, př x? — y? 
Then r?"! = 1 (mod p"*'). 


Again, with the same method, Inkeri (1946) gave the following generaliza- 
tion of Vandiver’s theorem (3E): 


(4E) If p42, n> 1, if x, y, z are relatively prime nonzero integers, such 
that x?" + yP" + z?” = 0, then 


x? = x (mod p?"*}), 
y’ = y (mod p***"), 
zP = z (mod p”"+ +). 


5. Hasse 


I have already described the success of Furtwängler using Eisenstein’s 
reciprocity law for the power residue symbol. This is nothing more than a 
very special case of the general reciprocity law for this symbol, as it was 
developed by Kummer (1850, 1852, 1858, 1859, 1859, 1887), Furtwängler 
(1909, 1912), and later subsumed under Takagi’s (1922) and Artin’s (1927) 
reciprocity laws for arbitrary abelian extensions. These matters are explained 
with great care by Hasse in his Bericht (1927, 1930). 

As often happens, it is not obvious how to deduce explicit consequences 
from a theorem which is very general. Hasse’s great contribution was to give 
a very convenient form to the general reciprocity law for the power residue 








5. Hasse 173 


symbol. And from this, he could prove, in a more systematic manner, many 
of the theorems previously discovered by Kummer, Furtwängler and others. 

I will not state here the general forms of any of the reciprocity laws. 
Instead, PH select and present only those explicit reciprocity formulas used 
by Hasse. 


(SA) Let «, Be K = Q(¢), «#0, B #0. Let F, and F be the conductors 
of the extensions K(%/«)|K and K(2/B)|K. If gcd(Aa,F,) = gcd(AB,F,) = 
gcd(F,,F,) = A, then {a/B} = {Pla}. 

(5B) Let «, Be K, a #0, p #0. If gcd(Aa, AP) = A, and if AA is unramified 


and each real place of K splits completely in K(x/@)| K (or in K(x/B)|K), then 
{a/b} = {B/a}. 


Another form of the reciprocity law proved by Hasse, in volume II, 
page 77, is the following: 


(5C) Let a ße A be such that gcd(Aa,Aß) = A. Assume also that « = 
1 (mod Aj?~'), ß = 1 (mod AA) and that a is totally positive. Then 


a 


If p = 2, and the symbol is the Jacobi symbol (—), the above becomes 
Jacobi’s reciprocity law: 
If a > 0, if a, b are odd, relatively prime integers, then 


a b\ _ yoo 
ER 


As a corollary of (5C), it is easy to show: 





(5D) Let a e Z, ße A, and assume that gcd(Aa,AB) = A. Assume also that 
p does not divide a and B = 1 (mod AA). Then: 


a) II) it, (Bot 
ye ncn 


As the conditions become more restrictive, the formulas are more special: 





(SE) Let ae Z, Be A\AA, and gcd(Aa,AB) = A. Assume that p ¥ a and let 
be Z be such that Op = 1 (mod AA?) (see Lemma 1.1). Then 


Biel 


I note that Eisenstein’s reciprocity law is nothing but a particular case of 
the above result. Indeed, if $ is semi-primary, say H = m (mod 44?), with 
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me Z, then BP"! = mp) = 1 (mod AA?). Applying (SE) to a and f?~* yields 


a peo? “ro 
ra foe” = 


ID Dr =({° where c(p — 1) = 0 (mod p) 
fj la 


hence c = 0 and {a/f} = {B/a}. 
With the above reciprocity laws, Hasse computed the values of various 


symbols. 








(SF) If o e A is totally positive and o = 1 (mod A4" TL then 


-ee 


(5G) If ae A, a = 1 (mod AAP), then 


f _ el) and f = golar) 
Ki Ki 


The next expressions involve the A-adic logarithms of Kummer, which 
I have already considered in Lecture VII, §5. As a reminder, if ae A and 
a = Í (mod AA), then the A-adic logarithm is defined by 





o — (wl 
log, a = o ? (a — 1)’, 


n=1 


the series being convergent in the A-adic topology; so log, « is an element of 
the completion K of K, relative to the valuation v, belonging to the prime 
ideal AA. 


Artin and Hasse proved in 1925: 
(5H) If ae A and « = 1 (mod AA), then 


H Vlog, o E? log, a 
Ge ’ and fel =: Tol D ) 
& D 


where Tr, denotes the trace in the extension K|Q,. 





Also in 1925, Hasse proved: 


(SI) If «eA is totally positive, gcd(Ax,Aß) = A, and « = B (mod 44°), 
then {a/B}{B/a}~* = £°, where 
Po , Tra ‘log, 0) x Tr,(C' log, ol 


EK 


i=1 P p 
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I note that c is a A-adic integer, hence it is of the form c = co + ¢,A + 
c24? + +++ with c e Z. The power ¢ is defined to be equal to o 

Another way of expressing the product in (5I) was discovered by Takagi 
in 1927. It generalizes a previous formula by Kummer (1852) for the case of 
a regular prime p. To explain Takagi’s formula, let g be a primitive root 
modulo p, let o:¢ — (8 be the corresponding generator of the Galois group 
of K|Q. For every j = 1,..., p, let 


oP —1 


Kj= (1-2? dei 
In particular, x, = ¢. Then, it may be observed that 
vi = ] (mod Air!) 
kj = 1 — X (mod 447+ t) 


forj=1,...,p. 
Moreover, every « € A, « = 1 (mod AA) may be written in a unique way in 
the form 


“a= Ki). oe KPa {OK (mod AjP*}), 


where each t,(«) is an integer. Takagi’s formula is the following: 


(5J) If «x, B € A, gcd(Au,AB) = A, and a = 1 (mod AA), B = 1, (mod AA), then 


EK 


u= F Diet, KB) 


j=1 


where 


Kummer’s formula is in terms of the logarithmic differential quotient, 
which I have introduced in Lecture VII, §5. Namely, if u is a real variable, 
«e A\AA andj =1,2,...,p— 1, then 


1N(q) = d logale") . 
du’ u=0 


These are p-integral rational numbers and they are related as follows to the 
above exponents t,(«): 


AN 
j! 





t(a) = Hie) (mod p). 


This gives Kummer formula: 


(5K) If a, Be A, gcd(Ax,Aß) = A, and « = 1 (mod AA), $ = 1 (mod AA) 


then 
a) (B)7} w 
BE =e 
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where 
PI a . 
w= E (= AIND) 
j=1 
As a corollary: 


(5L) If x € A, « = 1 (mod AA), then 


B _ {Ttr - 10) — CD "ai, 
& 


It follows from the definition 


ge 
D 


NL mod p). 
p 


that 


t,- (0) = IP" a) = 


Similarly, the following formula is due to Kummer (1858) and Takagi 
(1922): 


(5M) If xe A, « = 1 (mod A2), then 


B = (iv 
o 


and moreover t,(«) = Tr,(A~? log, al (mod p). 





Also: 
(5N) If ae A, as 1 (mod AR), then {A/a} = {r®*?, where 
r=? (—1}B; 2-3 (—1}B; o, 
b= 3 ( k j tp- ;(@) = 3 oe DR" tat (mod p). 
gal ` j=l 


In particular, if e = — A8 '/p, then {e/a} = 67°. 


After this review of the main formulas for the power residue symbol, as 
they may be found in volume II of Hasse’s Bericht, I will now describe their 
applications, which may be obtained by computing the power residue symbol 
for appropriate elements. 


Assume that x, y, z are nonzero integers such that x? + y? + z? = 0. So 
p-1 


(-2P = II (x + Cy). 
Let a = (x + {y)/(x + y), so i=0 
«=1-(yXx+y)A=1 (mod AA). 


I write t = p(x + y) = —y/z (mod p). Since Aw = J’ for some ideal J of A, 
then for every ße A, ß # 0, it follows that {B/a} = 1. By considering various 
elements $, different conditions may be obtained. 


eng 
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Taking B = ¢ leads to the criterion: 


(50) With above notations, 
pl 


A ` = 0 (mod p). 


Taking H = p gives the following: 


(5P) With above notations, 
P t 


1 
3 ; Txo (=) =0 (mod p). 


i=1 
From ß = A, it follows: 


(50) With above notations, 
ri = 1 (mod p°). 


From $ = g = — ABT Tip, the following may be deduced: 


(5R) With above notations, 


(p 3)/2 . : 
5 = 12-24) = 0 (mod p). 
J 


j=1 


For the first case, the logarithmic differential quotients are expressible in 
terms of Mirimanoff polynomials: 


. 1 
UI = . 
Tal = me gid (mod p) 
forj=2,...,p — 2, where u = t/(t — 1) = —y/x and 


pal . 
eX) = 3 NIT 
i=1 
The above criterion takes a form similar to Kummer’s congruences (for 
the first case): 
P7342 B,, 


E P,-2j(u) = 0 (mod p). 


j=l 
To obtain Kummer’ congruences, put £ =j(1—O/(1— ©) (for 
j=1,...,p — 1). This gives, after some calculations, Kummer’s congruences 
for the first case: 
B,jl?~?(a) = 0 (mod p) 


or equivalently, B,,P,-2,;(u) = 0 (mod p) (for j = 1,2,... (p — 3)/2). 
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Taking $ = « (for i=0,1,... 
for the first case: 


‚p — 2), gives Mirimanoff’s congruences 


Diop" (a) = 0 (mod p) 
or equivalently, 
P(u)p,- (u) = 0 (mod p) 


(for j=12,...,(p- 1)/2). 

The first theorem of Furtwängler is obtained by considering ß = I, where 
lis a prime, ! # p, and I|xyz. Then /?~* = 1 (mod p°). 

The. second theorem of Furtwängler is obtained similarly, taking ß = l, 
where | Æ 2, [|x — y, 14 p. 

As I have already stated, the theorems of Wieferich and Mirimanoff follow 
at once from the above theorems of Furtwängler and therefore may be 
deduced by means of Hasse’s theory. 

In 1971, Gandhi performed some computations along the same lines; 
he proved: 





(5S) If x, y, z are relatively prime integers, p X xyz, x? + y? + z” = Q, if 
l is any prime such that | ¥ xyz, | # 1 (mod p), and if t = vie (mod 1), u = z/y 
(mod 1), v = y/z (mod J), then 


x + Cy = (itv 
l 3 

f + = = di 
jz + e = D "ett 
l 3 


fori=1,2,...,p—1, where q,(l) = (IP71 — 1)/p is the Fermat quotient. 








~ 





Gandhi also found some analogous expressions if the second case fails 
for the exponent p. However, he gives an erroneous deduction of Vandiver’s 
condition that if the first case fails for p, then 5°"! = 1 (mod p’). 


6. The p-Rank of the Class Group of 
the Cyclotomic Field 


In these lectures, I have often arrived at important conclusions from the 
study of the class number of the cyclotomic field. A more refined approach 
consists of studying the class group and examining its structure. More 
specifically, considering the p-primary component of the class group. 
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I introduce or recall some notation. Let h = p’t, b > 0, p Y t, be the class 
number of K. If J # 0 is any fractional ideal of K, let [J] denote the class of 
J, that is, the set of all fractional ideals of the form Aa.J, for some « # 0. I 
write J ~ J’ when [J] = [J’]. Let €, = €/ (K) be the maximal p-subgroup 
of € = 6/(K); it has order p? and it is equal to {[J]'|[J] € €}. 

Since €, is abelian and finite, 6,2 2, x '''x Zm this is the unique 
decomposition as a product of cyclic p-groups. Let Z; have order gr. so 
bi +t- + bm = b. The integer m is called the p-rank of €. 

I choose an integral ideal J; such that [J;] is a generator of #;. Thus 
(Lil = [A], but [JP "ls [A]. Hence, every ideal class [J] € €, may be 
written in unique way in the form 


[Y= (i Pind (6.1) 


with 0 < x; < p* — 1. The set {[J,],...,[J,,]} is a basis of @,. 

Then there exists J such that [J] = [1]? ifand only if p|x,,..., pl 

I denote by ef the subgroup of @, generated by [J{],..., LL Then 
Wës is a vector space over the field F, having dimension m. 

For each i = 1,..., m, let 9, be the subgroup of all [J] € €, such that in 
the representation (6.1), p|x;. Then #(G,) = PL 

If K’ is a subfield of K, I use similar notations: h' = p°t', a > 0, pt; €", 
@, stand for the ideal class group and its p-primary subgroup. 

The following group homomorphisms arise naturally: 


1. vg + @ defined by ı([J’J) = [AJ]; observe that 1(6,) c €, 

2. Nxxi@ > € defined by Neix([J]) =[Nxjx(J)], where J#0 is a 
fractional ideal of K and Nos is the relative norm; I note that 
Nex(@,) < , 


Let N = {(J]€@,|Nxix [J] = LAT Thus 4 = Ker(N x). A is also 
a finite p-group and has a basis. 
I assemble in a lemma the following almost obvious facts: 


Lemma 6.1. If py [K:K'], then 


. Lis injective. 

, No", > €, is surjective. 

-€,= WN x C, (direct product of groups). 

. rank(€,) = rank(/) + rank(@;). 

DILL. LI is a basis of €), then there exist ideal classes 
Umaih -o [Im] AC such that (LAPL.. [Ae] [Im +1): - LI 
is a basis of €,. 


rue 


After these generalities, let K = Q(¢), where ¢ is a primitive pth root of 1, 
p> 2. Let K* = DU + {7 }) be the real cyclotomic field. The notations are 
the following, and they are self-explanatory: €, €*, h = h*h*, h* = p™t,, 
aı 20, pt; h* = pt, a > 0, p¥ t; €, has order io, €} has order P. 
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N is the subgroup of all [J] € €, such that Nxix+({J]) = [A*]; it has order 
p, Cp =G} x N. Let e = rank(@>), e, = rank(/), so rank(@,) = e + eı- 

Hecke proved a very interesting relation between these ranks e, e,. His 
proof was based on the fundamental work of Furtwängler in class field 
theory, more specifically its connection with the general existence theorem 
for class fields. Whereas the full results of Furtwangler were not required 
by Hecke, nevertheless he needed the explicit description of the unramified 
cyclic extensions of degree p of K. Of course this in turn may be easily 
obtained from the existence theorem of class field theory. 

I describe the results obtained by Furtwängler in his papers of 1904 and 
1907. Let a1,...,%,€A. They are p-independent when the following 
condition is satisfied: if at. -am = pP, with Be A and 0 < r; < p — 1 (for 
i=1,...,m), then r; =: =r, =0. 

In 1859, while studying the reciprocity law for the power residue symbol, 
Kummer proved the following important theorem, which may be also found 
in Hilbert’s Zahlbericht §135, Theorem 152: 


(6A) Let o1,...5%m€A be p-independent. Let jı,...,jm be integers, 
O<jy,.--5jm<P—1. For every n=1,...,p— 1 let 2, be the set of all 
prime ideals Q, Q 4 Aa; (i = 1,...,m), Q # AA, such that 


ol. ji ml _ jm 
Ca dap oe 


1. If s > 1 is areal variable, then 


SEN 1 1 
m ——-=(p-1)l 
r 2, 2, NOT OT OOST 
where f(s) remains bounded in the neighborhood of s = 1, and N(Q) is the 
absolute norm of Q. 
2. The set RT 2, is infinite. 

Actually, Cebotarev proved in 1923 the density theorem that asserts that 
each set 2, is infinite. This was further extended by Schinzel in 1977 (see his 
Theorem 4). 

Now, let f =e + e, =rank(@,), let UL... Lili be a basis of €, 
Without loss of generality, each J; may be taken to be an integral ideal and 
ALY Ji. 

Let pP: be the order of [J;], and let p; € A be such that 


[Jil = Ap: 
1) be a fundamental system of units, let 
e, = Cande,,;=p;(fori=1,...,f). Then {eis - - < er are p-independent. 


By Kummer’s theorem, there exist prime ideals Q,,...,Q,+,, distinct from 
AA, such that Q; does not divide any of the ideals J,,..., Jp and for every 


Then 


+ f(s), 





I write x = 4(p — 1). Let {e,,... 





D 


EE eh 


ye 


EN oa 


e E eer 
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i=l,....0+f, {e,/0;} # 1, {&,/Q,} = 1 (for j # i). So the ideals Q; are all 
distinct. 
I write 
[RJS] H" 
with 1 < uf? < p®. Let w = pi — up, so QP --- Je Ax, where kje 
AAL 0 < wh) < phi — 1, pen ee j 
Let S be the set of all algebraic integers a of the form 


= pli... pln n 
o = EI So ot T KĘT, 


O0<u;<sp—1,0<0,;< p—1 and such that the following f congruences 
modulo p are satisfied: 


dE dps Wi = 0 (mod p), 


ww) +s + Vra WEN = 0 (mod p). 


Then Sc A\AA, and #($)> p"*!*" = p?“!*S, Also, if we sS, then Aw = 
JJ, JYOT Oeuf, where p|ny,..., p|ny. 

In 1902, Hilbert defined oa, ße A\AA to be of the same kind if there exists 
yeA such that « = By’ (mod Ad’). This implies that y ¢ AA. The above 
relation is an equivalence. 

Furtwängler then defined a e A\AA to be a primary integer if there exists 
Be A such that « = ß? (mod 44”). It follows that there exists b e Z such that 
a = b? (mod 44”). 

It is not difficult to show: 


Lemma 6.2. 


1. All primary integers are of the same kind. 
2. If B € A\AA is of the same kind as the primary integer «, then B is also primary. 
3. There are pP”! distinct kinds of integers in A\AA. 


Since S c A\AA has at least p?~'*/ elements, there exist w,, œ, E S, 
distinct but of the same kind. Let w = w,w$~'. Then w is a primary integer 
and it may be written in the form: 


w = gir E God gë Et 


with « € A,0 < u;, v; < p — 1 and at least one of the integers u;, v; is different 
from 0. In particular, œ is not the pth power ofan element of K. 

The determination of œ depends on the choice of the prime ideals 
Q., vee Or With another choice Q),...,Q/4,, another element w’ is 
obtained. It may be shown that œ, œ are p-independent. 

Since @ is nota pth power in K, K(z/a) |K is a cyclic extension of degree p, 
that is, a Kummer extension with Galois group generated by o" Elo > Ceo. 

With every unramified cyclic extension of degree p of K, Kẹ = K(2/B) 


(where Ge A, ß not a pth power in K) Furtwängler associated a subgroup 


182 IX The Power of Class Field Theory 


Ze of the p-class group €, as follows: 9, is generated by all ideal classes [P], 
where P is a prime ideal, P # AA, P + AB and {ß/P} = 1; that is, P splits in 
K,|K. Thus #(G,) < pt nt, 
In fact, it will be shown ultimately that Y, has order pr" "`. Pending this 
result, Tet H, be a subgroup of €, such that 9, c #, and #, has order 
To study the correspondence K, +> #,, Furtwängler used the following 
general lemma: 


Lemma 6.3. Let L be any algebraic number field with class group €¢(L) 
and class number h, = p"r,m> 0, p{r. Let # be a subgroup of order p""' 
of EL). Then 

1 1 1 


<-lo 
poe NF = pes 


where f(s) remains bounded in a neighborhood of s = 1. 





+ f(s), 


Applying this to K, as well as to K*, Furtwängler showed that if Kg, Kg: 
are distinct unramified cyclic extensions of degree p of K, then Hy # Hg. 

Before stating the main theorem, I recall that K,|K,...,K,,|K are 
called independent extensions if none is contained in the composite of the 
others. Furtwängler’s main theorem is the following: 


(6B) 
1. K has f independent unramified cyclic extensions of degree p. 
2. Any f + 1 unramified cyclic extensions of degree p must be dependent. 


It follows from this theorem that #, = G,. More specifically: 


(6C) K has e independent unramified cyclic extensions of degree p, say 
K; = K(£/o,), with w0; = B?, where ß,e A*. 


Based on this theory, Hecke could prove (1910)—and this was certainly 
easier than what Furtwängler did—the following: 


(6D) 


1. There exist ideal classes [H,], . . . , [H.] E€ €, such that 
a. Nxix+({Hi]) = [A+] fori=1,...,¢2 
b. H? = Aen, where œ; is a primary integer. 
c. If [H J" -- - [H.]" = [A], then p|m,,... 
2.e<se,. 


‚plm.. 


I observe that this generalizes Kummer’s theorem that if p divides h*, 
then p divides h*. 
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It is interesting to see the implications of this theorem to Fermat’s problem. 
Assume that there exists o, ß, y e A\AA such that of + B? + y? = 0. Hecke 
showed that the above implies that e, — e > 1. Previously, in 1857, Kummer 
had shown that p? divides h = h*h*. It follows that p?|h*. Indeed, if p|h*, 
then rank(@7) = e > 1 and e, > e + 1 > 2, thus p?|h*; while, if p / h*, then 

lb. 
P GER proved that under the above hypothesis (failure of the first 
case in the ring A for the exponent p) then e, — e > 4 and therefore p*|h*. 

These results have been much improved by a totally different method 
invented by Vandiver. I will discuss this later in this lecture. 

To conclude this section, I shouldn’t pass over the beautiful reflection 
theorem of Leopoldt (1958). It is in fact a generalization of Scholz’s theorem 
of 1932. In Lecture XIII, Tl specifically need this theorem. I think it worth- 
while to describe, even though succinctly, Leopoldt’s theorem. In this, I 
follow the “exposé” by F. Bertrandias (1969). 

Let Y be the Galois group of K |Q, let @ be the group of characters with 
values in the group of (nonzero) p-adic numbers, namely, all the homo- 
morphisms 7:9 > Q,. Thegroupring Q,[%] contains a system of orthogonal 
idempotents: 

1 


ly SES 3 mie je 


oeg 


(for all y e 4) and moreover 


This induces a decomposition 
Q,[9] = © 9,1, 
XE% 


and similarly 
Z19] = © Zp. 
xe% 

Suppose now that V is any abelian p-group, say of order p", on which 4 
operates; I write v” for the image of v by o. This may be extended to an 
operation of Z [4] in the following way: if « = $ seg 4,0, with a, € Zp, let 
a,€Z,0 <a, < p"— 1 be such that «, = a, (mod p"). Then, the action of « 
on v is defined by 

p= I] (v7)%, 
geg 

Letting V, = {v'*|v e U}, then V, is a subgroup of U; ve V, if and only 
if v' = v, and in this case, v? = vu”. It follows also that V= [J eg Vy. _ 

Let reg complex conjugation. A character y such that y(t) =@ is 
even, while if y(t) = —1, it is odd. I note that if y is even, v" = v, while if x is 
odd, then v = v™! forall v e V,. An element v e V is invariant by t exactly if 
vEV* =[],even V,, While o sp "and only if v e V* TL oaa Vo 


x 
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The group Y operates on various interesting abelian p-groups. First, it 
operates on the group W ofall pth roots of 1. This is a cyclic group of order 
p, so there is a character y* of Y such that W = H. Namely, viel = 
Wm, 9°" (limit in the p-adic sense) where 7 = (9,1 < g < p — 1, C being a 
primitive pth root of 1. I note, therefore, that ( = (7°. 

Given any character y of Y, the character 7 = x ‘y* is called the reflected 
character of y. The reflection y ++ % in the group Ê is an involution, that is, 
x = x. Since x*(t) = —1,x* is an odd character and y, 7 have different parity. 

Another abelian p-group on which % operates is @,. So, for every character 
x of Y, I may consider the subgroup (@,),, whose rank is denoted by r, 

Leopoldt’s theorem relates the ranks r, and r,, for reflected St 
x and x. To bound the difference r; — r, , Leopoldt considered the action of 
% on the F,-vector space % = U/ UP, where U is the group of units of A (the 
ring of integers of K). The rank of % is (p — 1)/2. Consider now all units e, 
such that the extension K(x/e)|K is unramified. Their cosets eW? form a 
subgroup %, of A. For each character x of % let a. = rank(%,,,). 

Leopoldt’s theorem may now be stated: 


(6E) 
1. For every character y of G: — Sug < fy — Ty S Su 
2. If x is an even character, then r, < rz < r, 4 1 dnd 
3. e <e; <e + (p 3/2 


Of course the inequality e < e, is Hecke’s theorem. 


7. Criteria for p-Divisibility of the Class Number 


The general idea is quite simple: If the first case is false for the exponent p, 
to deduce that a very high power of p divides the first (or second) factor of 
the class number. Then by some means, like explicit computation, rule out 
this possibility, thereby assuring that the first case actually holds for the 
exponent in question. 

There are various papers of this kind in the literature. To begin with, I 
consider statements involving the first factor of the class number. For 
example, as I mentioned earlier, if the first case fails for p, then p ?|h* (Hecke) 
and, better, p*|h* (Furtwängler). 

The next results in the same line depend on a congruence for the first 
factor which was discovered in 1919 by Vandiver: 


(-1)P-DI2p 


Me A EEN 
nS 2@-372 


Ban (mod p”) 


1<si<p-2 


for n > 1. This congruence was proved again in 1966 by Hasse, using p-adic 
methods. A still simpler proof was given by Slavutskii in 1968. 
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In 1925, using the above congruence in appropriate ways, Vandiver 
showed that if the first case fails for the exponent p, then p® divides h*. 
Morishima (1932) sharpened this to p!?|h* (with two undecided primes) and 
Lehmer (1932) ruled out the exceptions. 

Whatever the virtue of these results, it would be much better to have some 
power p"® dividing h* = h*, where n(p) increases with p. 

In 1965, Eichler proved the following result: 


(7A) If the first case fails for p, then plv?!”! divides the first factor h*. 


The proof is one that relies only on basic principles and may be explained 
without much effort. In spite of the danger of making this lecture too long, 
I will now give Eichler’s clever proof. 


ProoF. If x, y, z are relatively prime integers, not multiples of p, and such 
that x? + y? + z? = 0 then, as said many times, there are ideals J; of A such 
that A(x + Cy) = J? (for i=0,1,...,p — 1). Let r = [Vp] — 1 and h* = pt, 
s>0, p/t. Assume that s < r. We will show this leads to a contradiction. 
By the pigeon-hole principle and the fact that p° is the order of the group 
N =@,/6,, there exist a;,..., a,,0 <a; < p—1, but not all a, equal to 
zero, such that J4!--- J@ ~ AJ*, where J* is some ideal of A" (the ring of 
integers of K* = Q(¢ + {7 IL Then 


II A(x + Çiy)“ SE US, å Sale = (Ap: AJ*)?, 
i=1 


where pe A; it follows that A(J*)? is principal, say A(J*)? = Aa with 
ae A". Then 


[I (x + Cy)* = patie, 
i=1 


where ¢ is a real unit (from the nature of units of the cyclotomic field). Since 

= (AA)P! and A/AA = F,, there exists be Z such that p = b (mod A). 
Then op = b? (mod pl Therefore | ];- , (x + C‘y)" = back (mod p). Taking 
the complex conjugates and dividing, 


r + e 
h(t) = (74 (mod p) 


therefore 





r x+ tly ER EE 
E : z) = (" (mod p) 
with n = 2j — f- ia;. Let 
x when a, z 0 
y when a; <0 
y when a}>0 
x when a; <0. 
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If Z is an indeterminate, let 


F(Z) = [] (x + Ziy) 
and Se 


G(Z) = [] (y; + Z'x,)'", 
i=1 


Then FU) = (GU) (modp) so there exists a o & 
: ‘ olynomial P 
coefficients in Z, degree at most p — 2, such that Penang 


FÜ) = (OU) + pP). 
This means that 
F(Z) = Z"G(Z) + pP(Z) + ®,(Z)O(Z), 


where ©,(Z) is the pth cyclotomic polynomial and Q(Z) has coefficients in 
Z, ee of Gauss’s lemma on factorization of polynomials 

ultiplying by 1 — Z, taking the derivatives, and setti 
leads to the congruence Br 


(= OEO = FO = -EGO at — IE) + - HECKE) (mod 2). 


Dividing by the preceding con i ici 
vidin gruence and comput 
logarithmic derivatives yield: EE 





une Pl ai - 9 ; pe 
(=O lala D ales Jali = (mod D 


From here, it follows that 





>= D sin; y 
1-9 2 od L SET an) = nl — ¢) (mod p). 


Let k > 1 be the smallest index such that : iplicati 
ee a, #0. Upon multiplication 


i (x + Deitz + tix) 


a congruence is obtained between polynomial e : 2 
i : xpressions in ¢, wh 
highest term is of the form 5 ¢, where the 


ayk(xy)"(x? Le Us ai 1)+1-k 


(m being a positive integer). Since r(r + 1) < p — 2, by the assumption 
gd. = and since IL. ..-,¢?~7} is an integral basis, the congruence 
estion gives x° = y* (mod p), ie, x = +y (mod p). B 
ý= +z (mod p). ( p). By symmetry, 
Hence from x+ y+ z=x? + y? + P= ; 
: = H + z = 0 (mod p) it foll 
against the hypothesis. Ge ows that p| 
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With a slight modification in the proof, the following is also true (but has 
not been explicitly published in this form): 


(7B) If the first case fails for p, then the p-rank of the class group 6 = CUK) 
is greater than /p — 2. 


Despite the interest of the above results, it is, in practice, not easy to 
determine the p-part of the first factor h*, or of the p-rank of the class group. 
Briickner obtained in 1975 the following more amenable result: 


(TC) If the first case of Fermat's theorem fails for p, then the irregularity 
index of p, ii(p) = {2j = 2,4,...,p — 3|p divides B,,}, is greater than Jp — 2. 


Unaware of the variant of Eichler’s theorem, Skula proved in 1972 that 
if the p-rank of the class group of K is 1, then the first case holds for p. A 
substantial simplification of Skula’s proof was published by Briickner (1972). 
But, in any case, this result is no more than a special case of the variant of the 
beautiful theorem of Eichler. 

Now, I would like to turn my attention to similar statements for the second 
factor of the class number. 

Carlitz established in 1968 a congruence between the first and second 
factors of the class number. Let m = (p — 1)/2, let g be a primitive root 
modulo p and Gy = det(g?”),, for O< j<m-2,1<n<m—I. Let 
SL... Em- (Z) be polynomials of degree at most p — 2, coefficients in 


Z, such that Tei)... &m—1(¢)} is a fundamental system of units of K. Then, 
aD P ygs 
Brent c ar > 
Erl) 2 i 
where Cys E Z. Put 
p-2 
Cyn = $ age forkn=1,...,m—1 
s=0 


and let C = dëtt, Finally, define G = 2"*?(C/Go). 
Carlitz showed that G is independent of the choice of the fundamental 
system of units and moreover: 


(TD) h* G = +h* (mod p). 


Note that this says that if p|h*, then p|h* (as Kummer showed), and 
conversely, if p|h*, but p ¥ h”, then pIG. 

In 1973, using p-adic methods, Metsänkylä proved a congruence for h*, 
which is actually equivalent to the one by Carlitz. 

In 1934, Vandiver stated and gave an abbreviated proof of the following 


theorem: 


If p does not divide h* , then the first case holds for the exponent p. 
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Referring to this theorem, Vandiver wrote: “I now give a sketch of a proof 
of a theorem which appears to be the principal result I have so far found 
concerning the first case of [Fermats] theorem.” 

Indeed, if this theorem is true, it is quite remarkable. However, L. C. 
Washington communicated to me in a letter that “Iwasawa has told me that 
there is an error in Vandiver’s proof which he has not been able to fix. 
R. Greenberg has also looked at it and arrived at the same conclusion. ... 
Anyway, the theorem should be regarded as questionable”. 

Therefore some of the results of Morishima’s papers XI and XII of 1934 
and 1935 are also unreliable (I have also discussed Gunderson’s criticisms in 
another lecture). 

If it can be established one day that, after all, Vandiver’s theorem is true, 
then the following simple proof of Skula’s result, as communicated to me by 
Hasse, will be justified: 

Suppose that the p-rank of the class group is 1, that is, €, is cyclic. Since 
Së, x M, then either €, or N is-trivial. Recall that if p divides h*, 
then p divides h*, as Kummer proved. Therefore, p cannot divide h*, other- 
wise both groups €, and A" are nontrivial. By Vandiver’s theorem (if it is 
true) the first case holds for p. 

I note that in the same year as Vandiver, 1934, Grün proved the following 
result, which though not as strong as what Vandiver claimed, at least is 
correct. 


(7E) If p > 3 is a prime not dividing the second factor h*, and if BA #9 
(mod p*) fork = 1, 2,...,(p — 3)/2, then the first case holds for the exponent p. 


8. Properly and Improperly Irregular 
Cyclotomic Fields 


Vandiver, following in the footsteps of Kummer, recognized the importance, 
for Fermat’s theorem, of the condition that p divides the second factor of the 
class number. 

In 1929, he introduced the following definition. A prime p is called properly 
irregular if p|h*, but p¥hj. It is improperly irregular if p|h}, hence also 
p|h* by Kummer’s theorem. 

Already in 1870, Kummer established that 2 divides h},, and bi, and 
3 divides hz,, and hłs7. However, Kummer found no example where plh,. 
In fact, no such example has yet been found, however there is no evidence 
to support a conjecture that p never divides h, (see Herbrand, 1932). 

The conjecture that p does not divide h} is commonly called Vandiver’s 
conjecture. Yet, it appears already in Kummere early work. Indeed, in a 
letter to Kronecker, dated December 28, 1849, Kummer stated that he thinks 
to base his proof of Fermat’s theorem for the exponent p on two properties. 
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The first one, “still to be established” amounts to the fact that p does not 
divide the second factor of the class number. In another letter to Kronecker 
(April 24, 1853), Kummer recognized the failure of his attempts to prove that 
p does not divide h}. l u 
In 1929 and 1930, Vandiver gave a necessary and sufficient condition for 
hi to be a multiple of p. This is expressed, as one would expect, in terms of 
the units of K. I recall my previous notations. 
Let g be a primitive root modulo p, let :¢ > (7 be the corresponding 
generator of the Galois group of K |Q. For every je Z, let g; be the unique 
integer such that 1 < g; < p — 1 and g; = g’ (mod p) (with the obvious con- 
vention when j < 0). Let 
WEE 
Riol = X g- aij 
j=0 


(for i = 1,2,...,(p — 3)/2). Let 
ES 
= Ex 


and finally, let &; = 6%, for i= 1,2,...,(p — 3)/2. Then: 








(8A) p divides h} if and only if one of the units ¢;(1 < i < (p — 3)/2) is the 
pth power of a unit in K. 


Vandiver also proved that the units £4, . . . , &)—3)2 form an independent 
system of units in K. 

A simpler proof of the above result was given by Inkeri in 1955. 

The following theorem of Vandiver, was first announced in a special case 
in 1930. The proof was published later in 1939. It is quite interesting, since 
it relates the irregularity index of p with the invariants of the p-class group 
of K. 

Morishima had essentially claimed this result already in 1933, but as 
Vandiver points out, Morishima’s proof is incomplete. 


(8B) Let p be a properly irregular prime number, let the p-class group of K 
be €, =Z, x ++: x £,, where each F; is a cyclic group, of order p” (i= 
1,...,f), b; > 1. Let m,, m2,..., mp be the indices such that 1 < m; < 
(p — 3)/2 and p divides B,„,. Then: 


1. f' =f and p” || Bom, (fori=1,...,f). e Së 
2. €, has a basis {{J,],... Heli such that [J;] has order p” and Jr" 
A(fori=1,...,f). 


These results are deep, the proofs quite involved. 
In 1976, using the theory of modular forms, Ribet proved a result which 
is intimately related with (1) of (8B) above. However, he did not assume that p 


divides h*. 
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Let I = @,/65, where €, is the p-primary component of the class group 
of K. Let y:Gal(K|Q)— F, be the character, with modulus p — 1, defined 
as follows. If g is a primitive root modulo p, or? the corresponding 
generator of Gal(K |Q), then y(o') = g’ for i = 0, 1,..., p — 2. Let Py) = 
{ye Tay) = x(o')y for i = 0,1,...,p — 2}. Then I'(y’) is a subspace of the 
F,-vector space I’, and if j =j’ (mod p — 1), then T(x’) = T(x”). Moreover 
r OO d F(x). 

Ribet proved: 


(8C) If 1<k <(p — 3)/2, then p divides Ba, if and only if I'(y'~**) #0. 


A better understanding of the structure of the class group and units of the 
cyclotomic fields is now being reached by means of the theory of p-adic 
L-functions, as illustrated in the recent work of Washington. 

I will now describe the method used by Washington to give a more 
systematic and transparent proof of various results by Pollaczek and Dénes. 

Borrowing from Iwasawa’s beautiful book (1972), I present the p-adic 
L-functions. Let y be a primitive modular character, with conductor f (see 
Lecture VI, §1). So ¥:Z > C, x(a) = 0 if and only if gcd(a,f) = 1, x(ab) = 
x(a)x(b), and a = b (mod f) implies x(a) = x(b). 

The generalized Bernoulli numbers B, , are defined as follows. If 


£ y(i)Te™ 
TV 





FAT) = 


the formal power series development of F,(T) is written as: 


Ka T" 
F(T) = Y B, 
n=0 n: 

In particular, if x = za, the principal character (with conductor f = 1), 
then B,,,, = B, (for n # 1), By, = -Bı =b. 

If L(s,x) = 4, (X(n)/n°) (for s > 1) and if y # Xo, the above L-series may 
be extended to a meromorphic function on the whole plane. For n = 1, 2, 
3, .. . it may be shown that L(1 — n, x) = —B,,,,/n. This is similar to a formula 
connecting the Bernoulli numbers B,, and the values of the Riemann zeta- 
function ¢(2s), once the functional equation for the Riemann zeta-function 
is taken into account. 

The aim in defining the p-adic L-functions is to have at our disposal, 
in the p-adic number field, a function which behaves like the ordinary 
L-function. Kubota and Leopoldt (1964) were able to construct such p-adic 
L-functions. 

I assume p > 2, but everything holds with slight modification when p = 2. 

The group U of units of the ring Z, of p-adic integers has the following 
structure: U x W x U, (group isomorphism and homeomorphism), where 
U, is the subgroup of all units « such that « = 1 (mod p) and W is the group 
of order p — 1 of all (p — 1)th roots of 1 in Q,. 
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Let w(a) = Dm, of for every ae U. Then w(a)e W and a) = 
a/w(a)e Uy. 

In particular, (oe Z <Z,, p ¥ a, then ae U, so w(a) is well defined; for 
the integers a, such that p|a, just put œ(a) = 0. Since the values of w(a) are 
algebraic numbers, hence complex numbers, the mapping w:Z > C is a 
primitive modular character with conductor p. 

If y is a primitive modular character with conductor f, for every n> 1 
let y, = yo". This is the product modular character, which is the unique 
primitive modular character with conductor dividing fp such that x,(a) = 
x(ajeo(a)~" for every integer a with gcd(a, fp) = 1. 

The theorem of Kubota and Leopoldt asserts the existence of a unique 
p-adic meromorphic function L,(s,y) such that the following two conditions 
are satisfied: 





i. 


+ > cas — 1)", 


L,(s,x) = i 
n=0 





C-1 
RS 


where each c; e Q,(x) (the field generated over Q, by the values x(a), for 


a=1,2,...,p — 1), 
1 
1-- ify=x, 
| P ° 


0 if x # Xo, 


and the series converges in the domain 
_ 1-2- 
f; e Q, = algebraic closure of Q,||s — Ip <p ?~ ` 


(| |, denotes the p-adic absolute value, normalized by |p|, = 1/p). 
ii. Forn = 1, 2,3,... 


B 
L, = ny) = (1 ~ xP) LCL nn) = -0 = Xap)" re. 


The functions L,(s,x) defined above are the p-adic L-functions. Thus, if 
X Æ Xo, L,(s,x) is regular at s = 1, while L,(s,yo) has a pole of order 1, with 
residue 1 — (1/p) at s = 1. . 

Now I shall require the p-adic regulator ofan algebraic number field, which 
is defined in a way similar to the ordinary regulator. For this purpose, I first 
consider the p-adic logarithm. Let « e Q, (algebraic closure of Q,) be such 
that |x — 1|, < 1. Then the series 

SE 
3 n 


n=1 


(o — 1)" 


is convergent and its sum is an element of Q, which is defined to be log,(q), 
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the p-adic logarithm of « (I have defined in Lecture VII, §5, the A-adic loga- 
rithm in similar way). The mapping log, may be extended in a unique way to 
the whole multiplicative group Q,, so as to be continuous and satisfies 
log,(t(«)) = 1(log,(a)) for every t € Gal(Q,|Q,) and log,p = 0. Note that 
Q is algebraic over Q,, hence it may be imbedded into Q,; I fix one such 
imbedding. 

Let L be a totally real number field of degree n, let p,,...,g, be the 
imbeddings of L into Q < Q,. Let {é,,...,¢,} be a fundamental system of 
units of L, where r = n — 1. 

The determinant of the matrix (log, 9;(€;));,;=1,...,, belongs to Q, and, 
up to a factor +1, is independent of the choice of the fundamental system of 
units. It is, by definition, the p-adic regulator of L, denoted by R,. 

Leopoldt conjectured that R, is always nonzero. This has been proved by 
Brumer (1967) when L | Q is a totally real abelian extension, and it is quite 
a deep theorem. In particular, if L = K*, its p-adic regulator is not equal to 0. 

Leopoldt’s p-adic class number formula puts together the various values 
of the p-adic L-functions at 1, the p-adic regulator and the class number h 
of K = DU: 

(p-3)/4 (p—3)/2 
_ ?P 


In particular, L,(1,07') 4 0 for each i = 1,...,(p — 3)/2. 

Dénes defined in 1954 the p-characters of the Bernoulli numbers as 
follows. If 1 < i < (p — 3)/2, the p-character of B,, is the smallest integer 
u; > O such that By; ,., # 0(mod p*“'**). In particular, p is regular if and only 
if the p-character of every Bernoulli number B,,(1 < i < (p — 3)/2) is equal 
to 0. 

Dénes raised the question whether such an integer u;, as above, necessarily 
exists. He promised a proof, but this promise remained unfulfilled. It was 
only in 1977 that Washington succeeded in showing that the p-character 
is always defined (as a finite integer). Precisely 


h L (1,007). 


u; = v,(L,(1,@7')) < œ, 


where v, is the p-adic valuation. The inequality reflects the fact that 
L,(1,0°) 4 0. 

The p-adic method of Washington was sufficient to obtain new proofs of 
results of Pollaczek (1924) and Dénes (1954) concerning fundamental systems 
of units for the real cyclotomic field. Because of its interest, I wish to mention 
the following result of Dénes (1954, 1956). 


(8D) There exists a fundamental system of units {nz Ma, . - - Mp-3} for K* 
of the form: 


ga = az; + b,,4° (mod A2 


with GEI bo; € Z, and 
Cz; = 2i + (p — Dun, 





ary 
2. 
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where 
0 < ud; < Uz = v(Lp(1,07')) 


for 2i=2,4,...,p—3. 


(8E) If p° is the exact power of p dividing h*, then 


(p— 3)/2 


a= > (uzi — Mac 


i=1 
In particular 


(8F) If p is properly irregular, then uy; = uz; for i=1,...,(p — 3)/2. 
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LECTURE X 


Fresh Efforts 


In this lecture, Pll present results obtained by various new methods. My 
choice is rather encompassing. There are some attempts, which belong among 
those described in my Lecture IV, on the naive approach. Others involve 
penetrating studies of the class group. And entirely new avenues are opening 
with ideas from the theory of algebraic functions. 

_ Whether or not these methods will solve the problem, they at any rate 
stimulate interesting investigations. 


1. Fermat’s Last Theorem Is True for Every Prime 
Exponent Less than 125000 


I announced in my first lecture that Fermat’s last theorem is true for every 
prime exponent p < 125000. Though, as I said earlier in Lecture VIII, the 
first case holds for p < 3 x 10°. ' 

Of course, the above verification has been made using modern computers. 
This is the result of the patient work of Wagstaff, but—as he assured me—not 
using the most modern computers available today. Instead, the majority 
of the programs were run on a ten-year-old IBM 360/65. He also used an 
IBM 370 computer, which is between three and five years old, in doing this 
work. These computers do not have either the speed or the features of the 
most advanced machines available today. But such advanced machines were 
not made available for this kind of work. Whatever computers used, the 
methods applied are quite reliable, so the outcome is not in question 

Before I start explaining the method, it is perhaps a good idea to review 
what was already done. 
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When Kummer proved his main theorem in 1847, and subsequently 
recognized that 37 is the smallest irregular prime, FLT was proved for 
p < 37. In his 1857 paper, Kummer proved a theorem which assured that 
FLT holds for a certain class of irregular primes (see Lecture VII). Applying 
his criterion to the irregular primes p = 37, 59, 67 less than 100, he concluded 
that FLT holds for all p < 100. However, in 1920 Vandiver noted that 
Kummer’s proofs of 1857 contained various gaps. Vandiver succeeded in 
1922, 1926 in repairing these imperfections guaranteeing already by 1926 
that FLT holds for p < 157. 

The prime 157, the smallest prime p with p? dividing the first factor of the 
class number h*, was not covered by Kummer criterion. 

In 1929, Vandiver published a long paper where he gave various new 
criteria. Let { be a primitive pth root of 1, K = Q(¢), and let A be the ring of 


integers of K. 


(1A) Assume: 
I. The second factor h* of the class number of K is not a multiple of p. 
II. None of the Bernoulli numbers Boy, (n = 1,2,...(p — 3)/2) is a multiple 
of pi. 
Then the second case of FLT is true for the exponent p. 


The above criterion is not easily applicable because the computation of 


the second factor h* is so tricky. 
The next criterion is more practical: 


(1B) Assume: 


Ill. There exists one and only one index 2s, 2 < 2s < p — 3, such that p 
divides Bas- 

IV. With the above index 2s, p? does not divide B,,,. Then FLT holds for 
Then FLT holds for the exponent p. 


All irregular primes p < 211, except 157, are covered by the above criterion. 
The third of Vandiver’s theorems was: 


(1C) If p = 1 (mod 4) and if p X B,, for every odd index s,2 < 2s < p — 3, 
then FLT holds for the exponent p. 


This criterion is not satisfied by 157, because 157 divides Be, and Bio. 
It is only his fourth theorem which applies to the prime 157. I introduce the 
following notation. Let a,, a2,..., a, be the indices such 1 < a; < (p — 3)/2 
and p|B,,,. As in Lecture VII, let g be a primitive root modulo p and let 
o be the automorphism of K = OU) such that o(¢) = C7, Let 
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and 
(p— 3)/2 i ! 
H. = TI a6 *"*) 
i=0 


fora = 1, 2,...,(p — 3)/2. 


(1D) Assume: 


V. There exists a prime I, l = 1 (mod p), | < p? — p, such that ifl<a< 
(p = 3)/2, p|B,.,, then the unit d. is not congruent to the pth power of 
an integer of K = Q(£), modulo L, where L is a prime ideal dividing Al. 


Then FLT holds for the exponent p. 


Using this theorem, Vandiver proved, in the same article, the more 
practical criterion: 


(1E) Assume: 


VI. There exists a prime I, Isi (modp), 1#1 (mod p°) such that the 
congruence 


X? + Y? + Z’ = 0 (mod!) 
has only the trivial solution. 


VII. With the above prime l, if 1 < a; < (p — 3)/2, P| Bza» then 


ave 
L H 


where L is a prime ideal of K = OU) dividing Al and {—} denotes the 
pth power residue symbol. 


Then FLT holds for the exponent p. 


Finally this is the first of the criteria which may be applied to p = 157 
Vandiver chose l = 1571, the primitive root g = 139 modulo 157, a, = 31, 
a, = 55 and ind(@3,) = 150 (mod 157), ind(@,,;) = 39 (mod 157). 

Vandiver also concluded that h, is not a multiple of p for every p < 211 

In 1930, with the same methods, Vandiver proved FLT for every prime 
P, 211 < p < 269. This was extended in 1931 up to 307 and again in 1937 
up to 617 by Vandiver and his assistants. At this point the computations 
became much too laborious for desk calculators. 

In an important paper of 1954, Lehmer, Lehmer, and Vandiver intro- 
duced a new criterion more appropriate for computations 

I have already mentioned, in Lecture VI, how Stafford and Vandiver gave 
practical computational methods to decide whether a given prime is irre Ka 
this remains after Kummer the only case which requires further SE i 
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The following lemma is basic: 


Lemma 1.1. Let | be a prime, l= kp + 1< p? — p. Let t be a natural 

number such that t* # 1 (mod l). For a > 1, let 
P72 4 1 GO Mee 
d= A jP-™* and Qa = am IL e&-i ; 
feat i=1 

With the notation already introduced, the unit 0, is congruent to the pth 
power of an integer in K = Od), modulo a prime ideal L dividing Al, if and 
only if OX = 1 (mod !). 

Using this lemma, Lehmer, Lehmer, and Vandiver proved the following 
criterion: 

(IF) Assume that p is an irregular prime, let a;, 1 < a; S (p — 3)/2, be the 
indices such that p|B,.,. With above notations, if 2* £ 1 (mod!) and Qk #1 
(mod!) for all above indices a,, then FLT holds for the exponent p. 


Using the SWAC calculating machine, the above authors proved that 
FLT holds for every prime exponent p < 2003. Soon after, Vandiver (1954) 


extended this to 2521. 
This only began a long series of computations by various mathematicians: 


Selfridge, Nicol, and Vandiver, in 1955—up to 4001, 
Selfridge and Pollack, in 1967—up to 25000, 
Kobelev, in 1970—up to 5500, 

Johnson, in 1975—up to 30000, 

Wagstaff, in 1975—up to 58150, 

Wagstaff, in 1976—up to 100000, 

Wagstaff, in 1977—up to 125000. 


Much valuable experimental information about irregular primes has 
been obtained along with these computations. This I have described in 


Lecture VI. 
In 1956, Inkeri extended the above criterion so as to guarantee that no 
solutions exist in integers of the real cyclotomic field. However, the proof 


is based on a result of Morishima (1935), which was criticized in Gunderson’s 
thesis (1948). 


3 Euler Numbers and Fermat’s Theorem 


The Bernoulli numbers seem like cousins to other numbers, considered 
first by Euler in the series development of the secant function: 
E E 

ee E EH (convergent for |x| < z), (2.1) 


E, 5 
secx = 1—37% Pa e 





| 
| 
) 
j 
! 
| 
r 
| 
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The secant coefficients were lat 
r er called Euler numbers by Scherk 
WE have been extensively studied, just like the Bernoulli EE en 
or example, they satisfy the basic recurrence relation: l 


2n 2 
SEN E S wat 
S Sa cl, roi, JEn-2+1=0. (2.2) 


Thus, 
E,=-1, Eis, Es = —61, 
E2 = 2702765, 
It is also customary to write 


E=1, 


Eg = 1385, E46 = — 50521, 
E,4 = ~—199360981, etc. 


Ban =0 forn> 0. 


Ge € a SE that the Euler numbers are integers. Moreover it may be 
at they are, in fact, odd integers, with alternati i 

d th: ; , ting s i 

g is So 1, 5 (a fact which is irrelevant Sc H 
More usefu is to establish their relationship wi i 

This was indicated by Scherk (1825): GE 


k r ; 

d 2k 27427! — 1)B,. 

E,,=1- 2e = Ba 
i A (a e ) 2i 


-_! eK. 
rae age zi Jaa" — oea (23) 


On the other hand, 


Zb (2k -1 
Soll D (EE (2.4) 


i=1 


If I have to say only a few more 
a fey words about Euler numbers, then Ichoo 
a at the aires which is analogous to the relation between the Bernoulli 
ers an the values ZO) of the Riemann zeta-function. This formula 
also discovered by Euler, is the following: l 


ire) 1 Leet EA k 
1)" _ ) 
2,‘ (2n + 1k! — oa! Ex (2.5) 





I will pass in silence over all th i 
€ numerous identities, congruence 
Se an of Euler numbers and I will just say a ee de 
nomials: they are defined and studied in a wa 
i j y analogous to Be i 
nn The classical books by Saalschütz (1893), Nielsen ee 
ae (1923) contain much interesting information on 
t is not all surprizing that the connecti ia 
all izing tion, via Kummer’ 
nn the primes dividing certain Bernoulli numbers End the ie 
ermat’s theorem, would suggest a similar theorem using the Euler numbers 


A prime p is called E-regular if ee 
Ee g p does not divide the Euler numbers 
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In 1940, Vandiver proved: 


(2A) If p does not divide E, -— then the first case of Fermat’s theorem 
holds for the exponent p. 


Thus, if p is E-regular, the above conclusion is still true. 
The basic fact used in the proof was a congruence obtained by E. Lehmer 
in 1938: 


[p/4] 1 [p/4] 
Yas} jel 1)P -124E -3 (mod p). (2.6) 


Zi j=l 


In his paper of 1954 about irregular primes, Carlitz also showed: 
(2B) There exist infinitely many E-irregular primes p, i.e., p|E2E4 ses Ejda: 


As in the case of regular primes, it is not known whether there are in- 
finitely many E-regular primes. 

The distribution of E-irregular primes was the object of two recent papers, 
by Ernvall (1975) and Ernvall and Metsänkylä (1978). I quote the following 
results. The first one sharpens Carlitz’s theorem. 


(2C) There exist infinitely many E-irregular primes p such that p # +1 
(mod 8). 


Let iiz(p) denote the E-irregularity index of the prime p; that is, the 
number of integers 2n, 2 < 2n < p — 3, such that p divides E,,. Ernvall and 
Metsänkylä computed all E-irregular primes p < 10* and found examples 
of primes p with iig(p) = 1, 2, 3, 4, 5 in the range in question. 

Let x be a positive real number, let 


n(x) = number of primes p, p < x 

n(x) = number of E-irregular primes p, p < x 

Tpe(X) = number of primes p, p < x, which are both irregular and E- 
irregular. 

Under the heuristic assumption that the Euler numbers are randomly 
distributed modulo p, it follows that 


. T(x) , 1 
Mae 
BE(X) 2 


im =1-—-+-. 
x>% MX) Je e 
The observed and predicted values of these ratios agree pretty well. 


Gut derived, in 1950, Euler number criteria, analogous to those of 
Kummer for the first case, when the exponent is equal to 2p, p an odd prime. 
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He showed: 


. (2D) If there exist x, y, z such that pX xyz and x?” + y?? = z?P, then p 
divides E,- 3, E,- 5; Ep-7, E,-9, E,- 11- 


These results, involving Euler numbers, should only be viewed as an 
alternate way of expressing equivalent facts involving Bernoulli numbers 
But joined together with other available information, they might turn out 
to be of some help. 


3. The First Case Is True for Infinitely Many 
Pairwise Relatively Prime Exponents 


Up to now, it has not been possible to show that the first case of Fermat’s 
theorem is true for infinitely many distinct prime exponents. The situation. 
however, is different for pairwise relatively prime exponents. Indeed this 
was established by Maillet in 1897: 


(3A) Let p > 3, let h = p*k, with u > 0, pk, be the class number of the 
cyclotomic field K = QY) (P = 1,¢ #1). Then if n> u + 1 and a, B, y are 
nonzero integers in K satisfying af + 8?" + y?" = 0, it follows that à = 1 — ¢ 
divides a, B or y. 


In particular, for n > u + 1 the first case of Fermat’s theorem holds for 
the exponent p”. In 1972, Hellegouarch gave a shorter proof for this theorem. 
In fact, his result applies for n > t + 1, where p' is the exponent of the p- 
primary component of the class group of K (note that u > t, so this is an 
improvement over Maillet’s result). 

As a consequence, for every prime p the first case holds for some exponent 
of the form p” and therefore, the first case is true for infinitely many pair- 
wise relatively prime exponents. 

It is worthwhile saying more about the smallest exponent M(p) such that 
the first case is true for every p”, n > M(p). Hellegouarch treated this question 
by generalizing the theorems of Furtwangler (see Lecture IX) for prime- 
Power exponents. As I have previously said, this had been achieved earlier 
by Moriya. With this method Hellegouarch proved, among other things 
the following generalizations of the theorems of Wieferich and Mirimanoff 


(3B) If p 7 3, n= 1 and the order of 2 modulo p is smaller than the order 
of 2 modulo p°", or the order of 3 modulo p is smaller than the order of 3 modulo 
p™, then the-first case holds for the exponent p". 


In other words, if the first case fails for the exponent p" = 
d th 2° 1 = 
(mod p?") and also 3?" = 1 (mod p°”). s SE 
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These congruences are stronger than the congruence 2?" 1 = į (mod p"*') 


obtained by Inkeri as indicated in Lecture IX. 

Thus the constant M(p) is at most equal to the smallest integer n such 
that o(2 mod p) < o(2 mod p°”) or o(3 mod p) < 0(3 mod p°”). 

Another result on the value of M(p) has been published recently by 
Washington (1977). Following the method of Eichler and writing the first 
factor of the class number as h* = p"* k*,where p Y k and u* > 0, then: 


(3C) Let p> 3; nz max{1,u* — Jp + 3} and x, y, z are nonzero in- 
tegers such that x?" + yP" + 2?" = 0, then p divides xyz. 


In some instances this result of Washington is better than Maillet’s and 
Hellegouarch’s. If p = 3511, then by Maillet’s theorem M(p) < 3, Hellegou- 
arch’s theorem gives M(p) < 2, while Washington’s theorem gives M(p) = 1. 

For the exponents of the form 2", Bohnitek proved the following theorem 


in 1912: 


(3D) If n > 3, then there do not exist integers a, bB, y, different from 0, in 
the cyclotomic field OU (where Cin = 1, ITT #1), such that 


ai! + SE + jee =Q. 


For example, if n = 3, this theorem says more than Theorem 169 of 
Hilbert’s Zahlbericht. Indeed it asserts that X* + Y* + Z* = 0 has only the 
trivial solution in Q(£,), and not merely in Q(i). 

More recently, in 1964, Kapferer, quite unaware of Maillet’s theorem, 
proved in a rather cumbersome way the existence of infinitely many pairwise 
relatively prime exponents for which the first case holds. His basic result is 


the following: 


(BE) Let a> 2,n> 1, e = 2 be integers such that 24 e; 3X e and a" — 1 = 
ef, where ged(e,f) = 1. If x, y, z are nonzero pairwise relatively prime integers 
such that x° + y° + 2° = 0 and a|x, then e| x. 


The above result may be applied with a = 2. According to a theorem of 
Bang-Zsigmondy-Birkhoff and Vandiver (1886, 1892, 1904), each integer 
2" — 1 (when n > 6) has a prime factor pẹ such that p, does not divide 2” — 1 
for every m < n. Let 2"—1= Bil where Pay f and e, = pi”. Then, by 
Kapferer’s theorem, the first case holds for all exponents e,. 

If it can be shown that there are infinitely many indices n such that r, = 1, 
then the first case would hold for infinitely many prime exponents pn- 

Odlyzko called my attention to a recent paper by Powell, where there is 
a very elementary proof of the following result: 


(3F) There exist infinitely many relatively prime exponents n such that the 
equation X" + Y" = Z” has no solution in nonzero integers x, y, Z such that 


gcd(n,xyz) = 1, that is, the first case holds for n. 





| 
| 
| 
| 
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As a matter of fact, if p i i 
SE ‚Ep ıs any prime and n = p(p — 1)/2, then the first 
The result (3F) is weaker than 
what was proved earlier, b i 
means by Ankeny (1952) and Ankeny and Erdös (1954). Namely. RERE 


(3G) The density of the set of ex i 
KE of exponents for which the first case is false 


4. Connections between Elliptic Curves and 
Fermat’s Theorem 


Let K be an algebraic number field ( i 
ebr of finite degree over 

ee curve, with a fixed point 0, both a e 

e the abelian group of K-rational points of E The 

a d celebrated th 

= ee (1922) SE generalized by Weil (1928), states that the Weeer 

nitely generated. So th isti i 
ee e subgroup F, consisting of the points of finite 
we E if K = Q and P = (x,y) with x, ye Q is a point of finite 
pages curve Y“ = X* + aX +b, now in Weierstrass form (abez 

+ 2107 # 0, then x, ye Zand y = 0 or y? divides 4a? +275. 

more is known and may be found in Cassels’ 

l i s excellent su 
ais (1966). I single out the following fact: 7 is either cyclic or the Sais 
of a cyclic group with the group of two elements, 

2 Kess Se GE Top “folklore” conjecture, that there is a uniform 
e orders of all points in ¥ i 
EE p A in F, which depends only on the field of 


In 1969, Manin proved the local analogue of the folklore conjecture: 


eg 2 e algebraic number Geld K and every prime p, there exists a 
= M(p,K) > 0 such that for every (nonsingular) elliptic curve o. 


defined over K, th ints i 
= ‚the orders of points in €, of order a power of p, are bounded 


However, no upper bound for M i i 
‚no | is effectively determined. 
In 1971, Demjanenko published a “proof” of the following result: 


(4B) For every algebraic number field K there exists a positive constant 


M = M(K) such that for an ipti 
i y elliptic curve €, d 
every point of finite order is at most M. SE 


However, Cassels’ review in 1972 should be taken into account: 


The author purports to prove the | i 
r pi ong-standing conj 
Mordell-Weil rank of an elliptic curve (=abelian ger a 
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over an algebraic number field is bounded by a constant depending only on 
the field. Unfortunately, the exposition is so obscure that the reviewer has 
yet to meet someone who would vouch for the validity of the proof; on the 
other hand, he has yet to be shown a mistake that unambiguously and ir- 
retrievably vitiates the argument... . 


So, whether or not the above statement of Demjanenko is true, since 
M(K) is not effective, the question still remains open: Which are the possible 
orders of the points of finite order of an elliptic curve over K? 

At this point, I would like to mention the work of Ogg (1971) and his 
conjecture. Ogg associated with every n>1 a curve #,, defined over Q, 
with the following property: For each couple (@,P), where @ is an elliptic 
curve defined over K, and P is a K-rational point of @ of order n, there 
corresponds a K-rational point of X,. Moreover, “almost” all points of 
X „are so obtained. 

The genus g, of X „ is computed to be: 


— 1 SU) 8 1). e dl 
en) 


pin 





where ọ denotes Euler’s totient function and t(d) = gcd(d,n/d). Thus g, = 0 
if and only if n < 10, n = 12. Also g, = 1 if and only if n = 11, 14, 15. 
Ogg’s conjecture was the following: If g, > 1, no elliptic curve € defined 
over K has a rational point of order n. 
A recent paper by Mazur (1978) settles the problem, confirming 
Demjanenko’s theorem, over the field (see Serre, 1977, 1978): 


(4C) The possible orders of points of finite order of an elliptic curve over 
Q aren=1,...,10 and n = 12. 


This confirms Ogg’s conjecture. The question remains open over general 
algebraic number fields. 

Since these lectures focus on Fermat’s theorem, I wish to report on 
Hellegouarch’s work, and its surprising connection with Fermat’s equation. 
Among his theorems, I mention: 


(4D) There exists an integer Ny > 17 and an integer N > 4 (effectively 
computable) such that if p is a prime, p = No, if h > 2 and if @ is any (non- 
singular) elliptic curve over Q with a rational point P of order p, then for 
every u=1,2,..., t the equation 


Te? = 0 


Me 


i=0 


has at least (p“ — 1)/2 distinct solutions in Q (not counting the trivial solution 
(0,0, . . . ,0)). 








5. Iwasawa’s Theory 209 


For elliptic curves having a rational point of order 2, the above results 
may be sharpened as follows: 


(4E) If p> 3 (respectively p = 3), and if the elliptic curve @ over Q has a 
rational point of order 2p' with t > 1 (respectively t > 2), then foru=1,2,...,t 
(respectively u = 2, . . . H the equations 


TR“ 4 TE“ 4+ TR“ H 
and 
TR“ + TE“ 4+ 2T8"=0 


have g least (pi — 1)/2 distinct solutions in Z [not counting the trivial solution 
(0,0,0)]. 


The above results, which were obtained before Mazur’s, may be viewed 
as giving information about the solutions of Fermat’s equation and analo- 
gous equations. 

On the other hand, without the newer results, they imply already the 
nonexistence of a rational point of order 2 x 33 = 54, as well as one of order 
2? x 3? = 36. But they wouldn’t be strong enough to rule out the existence 
of a point of order 18. 

Hellegouarch also showed, with similar methods, that if p > 3, there are 
no points of @ (defined over Q) of order 2p”. But again, this has been super- 
seded by the recent more powerful result of Mazur. 


5. Iwasawa’s Theory 


At this point, I’m tempted to present some material which is very attractive 
and goes deep into the study of cyclotomy. I don’t know whether this theory 
will have any direct connection with Fermat’s theorem. I consulted Iwasawa 
on this matter, but he wasn’t sure either. 

The idea is simply to take, for a given prime p, the tower of cyclotomic 
extensions K, = (UL where Con is a primitive p"**th root of 1 and to 
examine these fields, together with some associated Galois groups and their 
ideal class groups. Since p is prominent, the p-adic valuation plays an 
important role. Moreover, it is only natural to consider the p-primary 
components of the ideal class groups. Passing to the limit, the description 
becomes smoother and more natural. There will be certain numerical 
invariants attached to the whole system, which ultimately depend only on 
the given prime and reflect its special properties. 

I may trace two classical theorems as roots of this theory. First, let p = 2. 
In 1886, Weber proved (see also his Algebra, volume II, 1899): 


(SA) For p = 2, and every n 2 1, the class number of UG zl divides the 
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class number of Q(Can+:) and the quotient is odd. In particular, for every 
n > 1, the class number of OU -—) is odd. 


In 1911, Furtwangler extended Weber’s theorem for the case of odd 
primes: 


(5B) Let p be an odd prime, let n > 1. The class number of K, is a multiple 
of p if and only if the class number of Ko is a multiple of p. 


In other words, if p is a regular prime, then p does not divide h(K,) for 
every n> 1. However, if p is irregular, let e, < 1 satisfy pr||h(K,). How 
does e, behave with respect to n? This question is answered by Iwasawa’s 
theory, which, also applies to other more general situations. 

Serre’s lecture at Bourbaki’s Seminar (December 1958) is a very good 
guide to this theory. As usual, he is crystal clear, and I can’t do better than 
to follow his lead. However, due to lack of space, D not enter into too many 
details. 

Le Qc Koc Kca K, °°: be the tower of cyclotomic fields, 
with K, = Q(C,.+1), and Ko = ico Ke 

Let G, = Gal(K,|Q), so G, ~(Z/p"**)*, the multiplicative group of 
invertible residue classes modulo p"*!. Since p #2, G, is a cyclic group; 
it has order p"(p — 1). Let T, = Gal(K,|Ko), so T, is a cyclic group of order 
p". The Galois group Tœ = Gal(K,, |Ko) is the inverse limit of the groups 
I, (relative to the natural restriction of automorphisms). l is a compact 
totally disconnected group, which is in fact isomorphic (both algebraically 
and topologically) to the additive group of p-adic integers: I. = Zp As 
such, P. is a (topologically) cyclic group. 

Another aspect of the situation concerns the class group. Let 6(K,) 
denote the class group of K,,; this is an abelian group of order h(K,). Let 
ZAK, be its p-primary component. If h(K,) = p°"k,, with pXk, then 
EKn) has order p*. The group P, acts on the class group @(K,) and also 
on @,(K,). 

For every n > 1, taking norms of ideals, gives a homomorphism from 
€(Kn+1) into @,(K,), which is obviously compatible with the operations 
of the Galois groups I,.1, Ta. Let €, be the inverse limit of the abelian 
p-groups @,(K,,), with respect to the norm homomorphisms. 

Since I’, operates on @,(K,), the group algebra over the ring of p-adic 
integers, A, = ZA, also operates on @,(K,), that is, @,(K,) is a module 
over A, If A = lim A, then @, = lim @,(K,) is also an A-module. The first 
structure theorem asserts: 


(5C) For every odd prime p, there exist integers A, u, v and no > Q such 
that if n 2 no, then 
e, = An + pp" +y. 


The integers A, u, v depend only on the prime p. 


6. The Fermat Function Field 211 


The proofs of the above theorems of Iwasawa (1959) essentially use class 
field theory and I refer to Serre’s “exposé”. 

if u # 0, the growth of e, is very rapid, so it is interesting to know whether 
this may happen. Iwasawa has conjectured that for every p > 3, the cor- 
responding u, = 0. 

In successive papers, Iwasawa and Sims (1966) showed that u, = 0 for 
all primes p- < 4001. In 1973, Johnson established the same for S < 8000, 
in 1976, he went to 30000 and in 1977, Wagstaff pushed it further 6 
p < 125000. Some theoretical work by Metsänkylä (1974) gave the estimate 
Hp < (p — 1)/2, improving a previous bound of Iwasawa. Finally, in a 
1978 paper, Ferrero and Washington proved that u,=0. Actually 
the same result holds also when the ground field is any abelian extension 
of finite degree of the rational field. 

There is a similar theory concerning the growth -pri 
of the first factor of the class GE of a = a ae ae 
developed by Iwasawa, using p-adic L-functions (1972). 

The theory may be generalized for certain ground fields different from Q 
and some other types of extensions, called I'-extensions. 

There is now a flurry of research in this area. However, as yet, no explicit 
connection has been made with Fermat’s last theorem. 


6. The Fermat Function Field 


Fermat’s equation X” + Y" + z" = 0 (for n > 3) defines a curve F,, in the 
projective plane over the field Ö of all algebraic numbers. Hasse was the 


. first to investigate the rational points of F,, in the light of the theory of 


algebraic functions. 

If P is a point of F, let its coordinates be P = (x(P),y(P),z(P)) with 
xD). y(P), z(P) e Q (they are defined up to a a en Let 
Fa be the affine portion of the curve F,, consisting of all points P such that 
z(P) # 0. Let &, ņ be the functions from F? to Ö defined by &P) = x(P)/z(P) 
n(P) = y(P)/z(P). Let { be a primitive 2nth root of 1, Ky = Ol), and K = 
Ko(é,n). This is the quotient field of the ring K,[T,U] (where: T, U are 
indeterminates) modulo the prime ideal generated by T" + U" EL K is 
called the function field of the affine curve F °. E is transcendental over K 
and n= —(1 + €"), so it is algebraic over KAELL Thus K is an aigebtaic 
function field of one variable, having field of constants equal to Kg. This 
last statement reflects the fact that T” + U" + 1 is an absolutely fireduicible 
polynomial over Ko. 

Mr The ES theory of algebraic function fields of one variable, when 
€ constant field is not necessarily the field 
developed by Hasse and Chevalley, ore a ee 
The arithmetic Riemann surface S of K is the collecti i 
divisors pf of K|Ko. These may be viewed as symbols Me ae 
bijectively to the valuations of K, which are trivial on Ko. I write w, for the 
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valuation corresponding to the divisor %. Let ©, be the valuation ring of 
W, that is, the ring of all elements ae K such that w,(a) > 0. The ring ©, 
has a unique maximal ideal, Æ, = {« € K |w (a) > 0}. The quotient field 
Kp = Op My is an algebraic extension of finite degree of Ky. This degree 
fa = [Ky:Ko] is called the degree of w,, or of the divisor 4. The residue 
class of « eg, modulo 4, is written «(f), and it is called the value of « at 
the divisor £. 

It may be shown that the prime divisors of K|K, correspond to the 
points P of the affine Fermat curve 7 0. Those of degree 1 correspond to 
the points P with coordinates x(P)/z(P), y(P)/z(P) in Ko. More generally, 
each prime divisor % corresponds to a point P with coordinates in K4. 

The free abelian group 2 generated by all the prime divisors is called 
the group of divisors. So the divisors are the formal products a = IL, ag 
with a(%)e Z and a(Z) = 0 except for finitely many prime divisors %e ©. 
The degree of a is, by definition, deg(a) = 3. a(f) deg(/). 

The unit divisor is e = | |, 4 with e(%) = 0 for every 4. The divisor 
a is called integral if a(%) > 0 for every 4. The divisor b divides the divisor 
a if there exists an integral divisor c such that be = a. With obvious notation, 
this amounts to saying b(%) < a(f) for every %. 

Any divisors a,,...,4, have a unique greatest common divisor d = 
gced(a,,...,4,), with d(%) = min{a,(f),...,a,(f)} for every 4. They also 
have a unique least common multiple. If ged(a,,...,a,) = e (unit divisor) 
then the (integral) divisors a,,..., a, are said to be relatively prime. 

For each element « € K, « # 0, it is true that w,(a) = 0 except for finitely 
many prime divisors 4. Then div(a) =[], asi is a divisor, called the 
divisor of a. If %1,- - - , 4m (m = 0) are the prime divisors such that w,,(a) > 0 
and if Zenn. n(n = m) are those such that w a(t) < 0, then div(a) = a/b, 
where a= p}... ptm, b= fy ischt: "rn, Thus a, b are 
integral divisors, gcd(a,b) = e, a is called the divisor of zeroes, b the divisor 
of poles of «. It is true that a, b have the same degree, i.e., deg(div(«)) = 0. 

In particular, considering ¢, n € K, I may write div(é) = a/c, div(n) = b/c 
with a, b, c integral divisors and gcd(a,b,c) = e. Moreover, deg(c) = n because 
the field extension K|Ko(é) has degree n. So deg(b) = n and I may write 
b = ër" ` fn, where the %; are prime divisors (a priori not necessarily 
distinct). From the relation E" + 4" + 1=0 it follows that div(é" + 1) = 
rie = (45/0) x ++ x (file. 

On the other hand, from 


E + 1 =(E— Lg MAC 


where D = —1 fori=1,2,...,n, writing div(& — ¢,) = a;/c gives deg(a,) = 
deg()=n, and ou: an = Zi fa After renumbering, a; = fī, SO 
deg(%,) = 1 and hence %1,- . . fn are distinct. 

The prime divisors of degree 1 correspond to the points P; with coor- 
dinates (¢,,0,1) belonging to Ky = Q(¢). In the same way, other trivial 
solutions P, with coordinates (0,f;,1) correspond to the prime divisors 
dividing a. 
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Searching for nontrivial solutions of Fermat’s equation in Kọ corre- 
sponds to discovering other prime divisors of degree 1. The idea behind this 
approach to the problem is to make use of the vast arithmetic theory of 
algebraic functions and see whether some of its powerful theorems would 
imply or exclude nontrivial solutions. This approach has as yet produced 
no conclusive results. I want however to describe what has been achieved 
without attempting to explain all the terms and facts used. 

The differential d(1/€) = — d¢/¢? has the divisor 


ol «()) _ diff(K | Ko) 
č a? 


The numerator is the different of the extension K | KA), Because y = 
3 —(1 + č"), the prime divisors %,, with b = f; --- fn, are the only ramified 
primes; so diff(K | Ko(é)) = b" t, 

The genus g is computed by the formula: 2g — 2 = degree of div(d(1/é)). 
So 2g — 2 = (n — 1)deg(b) — 2 deg(a) = n(n — 3) and therefore 


(n — Um — 2) 
> 


Thus ifn = 2, then g = 0, so the Fermat curve is a rational curve. Ifn = 3 
then g =1 and the associated field K is an elliptic function field. For n > 4, 
SE 2. It is known that, for any divisor a, the set Joe K |div(a)a is an integral 
divisor} is a vector space of finite dimension over Ky. This dimension is, by 
definition, called the dimension of the divisor a. 

Now, TIl discuss the Weierstrass points of the Fermat function field. 


Let d = div(d(1/£)) and let % be any prime divisor. I consider the sequence 
of integers 





len <n<'<r,s2g-1 


ae is defined as follows. From the Riemann-Roch theorem, dim(d) = g. 
et 








ol A =. = dim( Zu) =0. 


The weight of % is by definition 


WA = È -D= Y ntg + 1). 
i= i=1 


Hence W(%) = 0 if and only ifr; =ifori=1,...,g. 
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The prime divisor % is called a Weierstrass point if W(f) > 0. The theory 
tells that W(%) < (g — 1)g when g > 1, and that there are only finitely many, 
say N, Weierstrass points. The sum of their weights is W = Z, WA = 
(g — glg + 1). 

If g>1, then gt 1<N<(g-1)g(g+ 1). In case the exponent in 
Fermat’s equation is n > 3, so g > 1, then the 3n trivial solutions of X” + 
Y” + Z" = 0 correspond to Weierstrass points, all with the same weight 


gin — Am 21 A _ (g-gn +4 
12 6n 








so the sum of these weights is Wo = $(g — 1)g(n + 4). 

If n=4, g=3, then W=W and the above are therefore all the 
Weierstrass points. 

However, if n > 4 there are many more Weierstrass points, the sum of 
their weights being equal to Wy = 4(g — 1)g(2g — n — 2). The correspond- 
ing points of Fermat’s curve have to be distinguished points (with coordi- 
nates algebraic, but not necessarily in OCI, Therefore, it should be of 
interest to determine the Weierstrass points. 


7. Mordell’s Conjecture 


Let K be an algebraic number field and let € be a nonsingular projective 
curve over K, with genus g > 2. In 1922, Mordell gave his now famous 
conjecture: The set of points of € which are K-rational (that is, have co- 
ordinates in K) is necessarily finite. 

For example, if this turns out to be true, taking K = Q and the Fermat 
curve ¥,:X" + Y" + Z” = 0, with n > 4, then there would be only finitely 
many solutions of Fermat’s equation in rational numbers, or equivalently, 
in integers. This is not as strong as what Fermat’s last theorem asserts. 
Still less so, if no bound on the number or size of the solutions is obtained. 
At any rate, for the moment a proof of Mordell’s conjecture seems, with 
good reason, very remote. 

I will report here on some work done recently to connect Mordell’s 
conjecture and Fermat’s equation. 

In 1972, Hellegouarch noted that if f(X,Y) is a polynomial with coeffi- 
cients in K and the equation f(X,Y) = 0 has only finitely many solutions 
Ge, pl with x; y;e K then, for any integer m > 2, there exists an integer 
ho > 1 such that for every h > ho f(X"",Y”") = 0 has only the trivial solu- 
tion in K. This remark led to the formulation of the following weaker Mordell 
conjecture: 

Let K be an algebraic number field, let p > 2 bea prime, f(X,Y) € K [X,Y]. 
If there exists h > 1 such that the curve defined by f(X?”,Y?") = 0 has genus 
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g = 2, then there exists k > h such that f(X"",Y”) = 0 has only the trivial 
solution in K. 

For example, if this conjecture is true, then it would give at once a weaker 
form of Maillet’s theorem of 1897. Namely, let K = Q, let f(X,Y) = X + 
Y — 1 and p>5, h = 1. Then the curve of equation X? + Y? — 1 = 0 has 
genus g = (p _ 1)(p — 2)/2 > 6. So there would exist k > 1 such that the 
equation X? + Y? — 1 = 0 has only the trivial solution, as Maillet proved. 
Hellegouarch gave some more support to this weaker conjecture. 

In 1965, Mumford examined the countable set of K-rational points of 
a nonsingular projective curve @ of genus g > 2. Short of proving that there 
are only finitely many solutions, Mumford wrote the solutions in order, by 
increasing height: P,, P2, P3 --- and he was able to prove: 


(TA) There are real numbers a, b, with a > 0, such that the height of P; 
satisfies: ht(P,) > e+», 


The concept of height of a point is explained in Mumford’s paper. 
In the special case of Fermat’s curve and K = Q, this result becomes: 


(7B) Let n > 4 and let (x;,y;,z;) be an infinite set of distinct positive integral 
solutions of X" + Y" = Z", such that gced(x,y,2) = land z} < Z3 < ee, 
Then there exist real numbers a, b, a> 0, such that z; > exp{exp(ai + b)}. 


To conclude this brief excursion into Mordell’s conjecture, I would like 
to mention the work initiated by Lang and enhanced by Kubert and Rohrlich 
(1975, 1977). Kubert and Lang have already published three papers, in 
what is announced to be a long series. The papers are very sophisticated 
and I dare not explain their contents. Perhaps, a paragraph which is indica- 
tive of the tenor of the first paper should be quoted here: 


... given a projective nonsingular curve V and an affine open subset Vo, 
does there exist an unramified covering W of V, and units u, v in the cor- 
responding affine coordinate ring of W, such that u + v = 1? Whenever this 
is the case, we get a correspondence 


®, H 
\ Z N unramified 
pi H 


which reduces the study of integral points on V to integral points on [the 
Fermat curve] ®;--- . We see that the Fermat curve ©, has such a cor- 
respondence with ®,. 


Rohrlich considers the group 2” of divisors of the function field of 
Fermat’s equation which are on the points at infinity, that is, the trivial 
solutions of X” + Y” + Z" =0 (n > 3). 
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Let F” be the subgroup of principal divisors. In his thesis, Röhrlich 
computes the quotient group, arriving at the following result: 
(Z/n)?"~7 if n is odd, 
(Z/n)”""" x Z/2 if nis even. 

It is not explicitly stated where these studies will lead, and for this we 
all have to wait and see. 


D/F? = | 


8. The Logicians 


What do the logicians say about Fermat’s last theorem? They are concerned 
about the truth, the possibility of proving the theorem starting from a given 
set of axioms, and about the question of undecidability. I wish to explain 
briefly the meaning of these expressions, avoiding any technicalities. 

Fermat’s last theorem concerns a sentence of the universal type, because 
all its variables are preceded by the quantifier “for every”: “For every x, 
for every y, for every z, for every n: either xyz = 0 or x"*? + y"*3 4 z"+3» 
It is understood that this sentence is to be interpreted with x, y, z, n as natural 
numbers (including zero). If the sentence is true with this interpretation, 
then Fermat’s last theorem is true. 

The negation of this sentence is an existential sentence: “There exists x, 
there exists y, there exists z, there exists n, such that: xyz # 0 and ap"? + 
ynts = zu+3» 

The axioms of Peano’s arithmetic involve the symbol 0, the unary opera- 
tion o (successor), and the binary operations addition and multiplication. 
They are very well-known, so I need not repeat them here. Let me just 
recall that one of the axioms, the principle of finite induction, is in fact 
an infinite set of axioms, namely one axiom for each formula in the formal 
language. 

The set N of natural numbers, with the symbol 0 interpreted as the 
number “zero,” the operations +, -, as addition and multiplication of 
natural numbers, and the successor operation interpreted as the successor 
of a natural number, becomes a model for Peano arithmetic. This means 
that every axiom of Peano arithmetic is true in N, with the above 
interpretation. 

More generally, any set A, with a constant, two binary operations, one 
unary operation, such that Peano’s axioms are true in A, with an appro- 
priate interpretation, is also called a model of Peano arithmetic. 

The theory of Peano arithmetic consists of all sentences which may be 
proved (with the usual rules of inference) from the axioms of Peano arith- 
metic. Clearly, every theorem of Peano arithmetic is true in any model of 
Peano arithmetic. In other words, if a sentence may be proved “syntactically” 
(i.e. with the rules of inference) from the axioms, all its interpretations in 
any model are true, so it is “semantically” true. 
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In 1934, Skolem provided specific models of Peano’s arithmetic which 
are not isomorphic to N, but have exactly the same theorems as N. As one 
says, they are elementary equivalent to N. Such models are called non- 
standard models of N. 

A theory is consistent if, for any sentence S in the language of the theory, 
it is not possible that both S and its negation 1S be theorems. If a theory 
has a model then it is necessarily consistent. 

On the other hand, a theory is complete if, for any sentence S in the 
language of the theory, either S is a theorem or its negation 1S is a theorem. 
Thus, if a theory is complete, any sentence S in the language of the theory 
is either true in every model, or false in every model of the theory. 

One of the most fundamental theorems in logic is Gödel’s completeness 
theorem: If a theory is consistent, then it has a model. Another way of ex- 
pressing this theorem is the following: Every consistent theory may be 
extended to a consistent and complete theory. Still another way to put it, 
closer to our concern, is the following: A sentence is a theorem in a given 
theory, that is, it may be proved from the axioms if (and only if) it is true in 
every model of the theory. 

If the theory is not complete, there are sentences S which are true in some 
model and false (that is, 1S is true) in some other model. Any such sentence 
is called an undecidable sentence. Both S and 1S cannot be proved from 
the axioms. 

Considering specifically the theory of Peano arithmetic, Gödel proved 
that this theory is incomplete. And much more. In fact, roughly speaking, 
every formal extension of Peano’s arithmetic is still incomplete. So, there 
are sentences in Peano arithmetic which are undecidable. 

Is the statement F (Fermat’s last theorem) undecidable in Peano’s arith- 
metic? I show that if F is undecidable, then F is true in the model N. Indeed, 
since F is undecidable, IF is not a theorem in Peano arithmetic. Thus, 
JF is not true in some model M of Peano arithmetic. So F is true in the 
model M. But the model N has the special feature of being a prime model, 
which means that if a universal sentence S is true in some model M, then it 
is true in N. In particular, F (Fermat’s last theorem) is true in N. 

There are other interesting considerations about the logical aspect of 
Fermats last theorem. The fact that the sentence F is a theorem depends 
essentially on how rich is the given collection of axioms. In a sense which 
Pll not make precise, Shepherdson (1965) has shown that, with methods 
current at the time of Fermat, and without “auxiliary operations” (like, say, 
Legendre’s symbol, and this is explained in Shepherdson’s paper) for no 
value of n > 3 it is possible to prove the sentence 


F,,: “For every x, for every y, for every z either xyz = 0 or x" + y" # 2”.” 


The recent solution of Hilbert’s tenth problem has also some bearing 
on Fermat’s last theorem. This is explained well in the paper of Davis, 
Matijasevi, and Julia Robinson (1976). The new idea is the possibility of 
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transforming the proof of Fermat’s last theorem into the verification that 
a polynomial equation has no solution in nonnegative integers. 

The method is based on the following fact. There is a polynomial 
P(X,Y,Z,W,, ...,W,), with coefficients in Z, such that the equation Y” = X 
has solution in natural numbers X = a, Y =b, Z =c if and only if the 
Diophantine equation P(a,b,c,W,,...,W,)=0 has a solution in natural 
numbers W = w,,..., W, = Wx. 

Thus, Fermat’s theorem is true if and only if the system of equations 


P(A,X +1,N +3,U,,...,U,) =0 
P(B, Y +1,N +3,V,,....%,) =0 
P(A + B,Z,N +3,W,,...,W) =0 


with indeterminates A, B, X, Y, Z, N, U,,..., DU... Vee Wi.» Wo 
has no solution in natural numbers a, b, x, y, Z, A, U4, - - -, Wg- Considering 
the square of the polynomial P, this is equivalent to the condition that 
the polynomial equation 


PXA,X+1,N+3,U,,...,U)+ P{B,Y+1,N +3,V,...,W) 
+ P(A + B,Z,N +3,W,,...,W)=0 


has no solution in natural numbers. 

At present, it is possible to construct explicitly a polynomial like the 
polynomial P, with less than 12 indeterminates. Can we hope for more 
with this method? 

Recently, I came across a still unpublished manuscript by Zinoviev, 
which was circulated in the Symposium on Modern Logic, organized by 
the Istituto dell’Enciclopedia Italiana, in 1977, in Roma. The last sentence 
of that text is: “It means that GFT (the Great Fermat Theorem) is 
unprovable.” 

Having been unable to understand this paper, I consulted Professor 
Kreisel, who was present at that particular meeting and who replied to 
me (1978): “In my considered opinion, Zinoviev’s paper on Fermat’s con- 
jecture is worthless.” 
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LECTURE XI 


Estimates 


In the preceding lectures, I have always mentioned the efforts of mathe- 
maticians to prove Fermat’s last theorem. Their attitude was, in the main, 
the following: 

Fermat's last theorem is true; let’s try to find a proof. Any progress 
towards the complete proof will be worthwhile, whether it deals only with 
the first case, or if only covers certain exponents. 

With this rationale, more and more complicated methods have been 
brought into the battle with, I must admit, only relatively minor success. 
This seemingly hopeless struggle prompted some mathematicians to begin 
to doubt the truth of the theorem. But their work might even turn out to 
be useful in proving the theorem. 

TU begin evaluating, with elementary methods, the size of the smallest 
possible solution, for any given exponent p. Some more sophisticated estimates 
for the first case are based on the criteria involving the Fermat quotients. 

Recently, the methods of diophantine approximation and linear forms 
of logarithms have provided a new tool to attack the problem. It has still 
to be fully exploited. 


1. Elementary (and Not So Elementary) Estimates 


The approach is the following. Let n be any natural number, n > 2, and let 
x, y, z be real numbers, such that 


0<x<y<z (1.1) 
and 
De er (1.2) 


226 XI Estimates 


In 1856, Griinert proved: 


(1A) If 0 <x < y< zare integers and x" + y" = 2", then x > n. 


PROOF. 


n-1 


x" = z" — y" = (z — yZ ht Byte yT?) > (2 pn 


Hence 
x" x 
Dees 
ny n 
and 
x 
yrlezeyh- 
son<x. 0O 


This shows that any counterexample to Fermat’s last theorem must 
involve large integers. For example, ifn = 101 then x > 102 and the numbers 
involved would be at least like 1021°!. As a matter of fact, it is rather easy, 
without any powerful methods, to show that any would-be solution must be 
substantially larger. 

From the above proof, it follows that 


x 1 
= N 1.3 
TEE OR (1.3) 


Hence z, y are relatively close together and therefore the size of x should 


be much smaller. 
On the other hand, Perisastri showed in 1969 that x cannot be much 


smaller than z: 
z< x’, (1.4) 


To obtain further estimates, it is convenient to introduce the positive real 
numbers r, s, t, "1, 81, fu defined as follows: If n = p is an odd prime, if 0 < 
x< y< z are real numbers and x? + y? = zP, let 











xP + P 
x+ty=P, log, z= tty, 
x+y z 
zł? — P 
z-y=r’, x =r, A = Tf], (1.5) 
zP — xP 
z=x= s, EE = si, y = 8S4. 


These relations are reminiscent of the ones obtained by Abel, as explained 
in my fourth Lecture. Now, it is not required that these numbers be integers. 
However, if x, y, z are nonzero integers and p does not divide xyz, then all 
the above numbers are integers, as Abel proved. 
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From (1.5), it follows that 


RAIL —rP + sP + ¢P rP? + SP + ¢P 
Ue ag Ce sae, Zn = “te 
and 
rP + sP — 1? 
EE (1.7) 
It is also obvious that 
0O<r<s<t. (1.8) 


A better bound for the first case is the following: 


(1B) If x, y, z are integers, not multiples of p and x? + y? = z’, then x > 6p?. 
If p divides xyz, then x > 6p. 


ProorF..I will illustrate in this proof how some of the naive results of Lecture 
IV may be put to use. There it was proved that if p / xyz, then p*rst divides 
x + y — z [see Lecture IV, (3B)]. Since 0 < r < s < t, at worst r= 1, s = 2, 
t=3,s0 6P <x+y-z<x. 

The other case is similar. o 


The search for improved lower bounds for eventual solutions of Fermat’s 
equation was the object of numerous papers. I will not narrate step by step 
all these improvements. Conceptually, these results do not throw any more 
light onto the problem. 

However, as in giving the latest news, I will mention Inkeri’s estimate, 
which is at the present time the best one known. His method depends not 
only on inequalities which would be generally true, but also on precise 
joao properties of the numbers r, s, t. Inkeri showed successively 


Oct <s <r (1.9) 
If 2 < y (for example, if x, y are integers) then t < t4. (1.10) 


, These two inequalities are quite obvious from the definitions. More work 
1s required to show that 


p-1 





t> and t—s> z (1.11) 


The next inequality is fundamental. It was first shown by James in 1934 
and rediscovered by Inkeri, who based his proof on the following lemma: 


Lemma 1.1. Let a, b be distinct positive real numbers, a + b < 1 andn > 5. 
Then E 


(1 + a” —_ b")" + (1 = a" + b"" > O + a" + b")". 
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From relations (1.2), it follows that 
(rP — SP + £7)? + (—r? + SP + tP) = (r? + S + E (1.12) 
The fundamental inequality is 
r+s>t. (1.13) 


To prove it, let a=r/t, b = s/t hence Oo<a<b<1.Ifr+s<it, then 
a+b<1 and by Lemma 1.1 this would contradict (1.12). 

If x, y, z are integers, not multiples of p, 0 < x < y < z, X? + y? = z’, then 
r, s, t have the following arithmetical property: each one of these numbers 
has a prime factor q such that q = 1 (mod p°). 

The proof of this statement is not elementary. More precisely, using 
methods described in my Lecture IV (The Naïve Approach) it is only possible 
to show that q? = 1 (mod p°). 

To conclude the proof, Furtwängler’s theorem is required: Each prime 
factor q dividing xyz satisfies the congruence g?~! = 1 (mod p°). Hence 
q = q? = 1 (mod p°). Since class field theory is needed to establish Furt- 
wangler’s theorem, the above proof is no longer elementary. 

Finally, Inkeri showed (1953): 


(1C) If p is an odd prime, 0 < x < y < z are relatively prime integers, such 
that x? + y? = zP, and pj xyz, then 


2p? + p\? 
” ( pet z) i 
log(3p) 
SKETCH OF THE PROOF. By the preceding result r = rg, s = Süd, t = t'q3, 


where r’, s’, t are natural numbers, and q,, q2, q3 are primes and q; = 
1 (mod 2p’), q, = 1 (mod 2p”), q3 = 1 (mod 2p”). By Lecture IV, (3B), 





r +s — t = rq + sq -— tg; = 0(mod p°). 
First Case. If max {r',s't'} > p?/2, then t > (p?/2)(2p? + 1) > p*. Since z — y 
> 1 and z > (x + y)/2 = 12, 
(Tt P 
xP = z — yP > (z — y)! > (5) 
so x > t?~1/2 Annie > [(2p° + p)/log(3p)]P. 


Second Case. If max {r’,s’,t'} < p?/2, then 
p? D DH H 2 
> <r+s—-t <p’. 


But p? divides r’ + s' — t' sor’ +s'=t'. Hence t — s = t — s = r (mod 2p’) 
so t — s = 2mp? + r’, where m is an integer. Using (1.11) it follows that m > 1. 
It may be shown, using (1.12) that 


r SEA 
~<&= p] ka 14 
"et NEE (1.14) 
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hence 
t—s 
t > ——. 1.15 
ae (115) 
Since 
pete Dr 


xP > (z — y)z? T! = Pz! > 





2 ` 
it follows that 


x> 





rt?~} (Gs) 2p? + 1f 2p +1 [rt 


2 OTS). ur E (2) 
p log2 


Assuming p > 1190 (otherwise there is no solution) then 


2p VF" 
2p el 2 < [log(3p)]? 


2p? + | 
> K 
ee = 


Similarly, Inkeri obtained bounds for the second case: 





(1D) If p is an odd prime, 0 < x < y < z are relatively prime integers, such 
that x’ + y? = zP and p divides xyz, then 
3p-4 


and y>4p?r-!, 


x>p 

Since the first case is true for every prime exponent p < 3 x 10° then in 
this case x would have at least 12 x 10'° digits. Similarly, in the second case, 
x would have at least 18 x 10° digits. 

Another interesting estimate was obtained by Inkeri and van der Poorten 
(in 1977); it gives the following lower bound for the difference z — x in terms 
of the exponent p: 

Z—X > Pp”, (1.16) 


2. Estimates Based on the Criteria Involving 
Fermat Quotients 


J have said in Lecture VIII that if the first case fails for the exponent p, then 
the Fermat quotients of p with bases q = 2, 3, 5,7,..., 43 (all primes at most 
43) are congruent to zero modulo p. In other words 


q? — 1 = 1 (mod p?) (2.1) 
for q prime, g < 43. 
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In 1940 and 1941, Lehmer and Lehmer devised a very ingenious method 
to considerably extend the truth of the first case. 

In his doctoral thesis (1948), written under the supervision of Rosser, 
Gunderson devised another method, along different lines from Lehmers’, 
which actually provided stronger results. I cannot enter into great detail 
about these methods, but TIl expose their general lines. 

Let p; = 2, p2 = 3,p3=5,---.Pi1 = 3L. .- , P14 = 43,... be the sequence 
of primes. For every n > 1 let P, be the set of natural numbers whose prime 
factors are at most p,. 

For every x > 1 let P,(x) be the number of elements a of P,, such that 
1 <a<-x. Similarly, let P*(x) be the number of odd integers a, such that 
l<asx,aeP,. 

For every prime p let W, be the set of natural numbers a> 1 such that 
ar”! = 1 (mod p°), and for x > 1 let W,(x) be the number of elements ae W,, 
UE EE? 

I rephrase the known criteria for the first case as follows: if the first case 
of Fermat’s theorem fails for p then P,, c W,, because W, is closed under 
multiplication. 

Rosser (1939, 1941) and Lehmer and Lehmer (1941) showed: 


Lemma 2.1. 


2. If N > 1 and Py c W, then 


2 2 2 
P DE EC, 
m Fr ep er 
Gere 
If M is a good approximation to P, ‚(p?/2) from below, the lemma gives 
a lower bound 2M + 1 < p. Hence if p < 2M + 1 then the first case holds 
for p. 
This is what Gunderson worked out. 


Let n > 1, x > 1, y > 1 and let P,(x,y) be the number of pairs (a,b) with 
gcd(a,b) = 1,a,beP,, 1<a<x,1<b<y. 


Lemma 2.2. If Py = W, and if 1 < x < p?/2, then Py(x,p?/2x) < (p — 1)/2. 


The function P,(a,b) is not simple to evaluate. Gunderson proposed to 
replace it by a function with smaller values. 
Ifn=2,x>1,y> 1, let 


2(log x)(log y) 
eo ern 2.2 
G2(x,y) (log 2)(log 3) ( ) 
and for n > 3 let 


1 y dz x dz 
gax, Y) = vc) d El + iF as | (2.3) 
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First Gunderson showed: 





P (x,y) > g(x,y) (when max {x,y} > 3). (2.4) 
Next, he gave the following expression of g,(x,y). Let n > 2 and 
ae 2 1 
"=n! (log2\llog3)---(ogp,) ES 
Then 
_ = i nf 2\(n 
i=1 = H 


With this preparation, Gunderson proved 


(2A) If N = 3, p > 3 and Py c W,, then 


2 


P-1, E E d RR (1) 


_—— — KL 
N-i N! (log 2)(log 3): - (log py) 


From the Fermat quotient criteria up to 43, it follows that the first case 
of Fermat’s theorem holds for every prime exponent p < 57 x 10°. 

To illustrate the power of Gunderson’s method, we compare with the 
recent computations of Brillhart, Tonascia, and Weinberger. As I have 
already said, they found no primes p <3 x 10° (except 1093 and 3511) 
satisfying the Wieferich congruence 28" 1 = 1 (modp?); and the two ex- 
ceptional primes did not satisfy the Mirimanoff congruence 3’~! = 1 
(mod p?). Unless the computing techniques are substantially improved, it 
will be very time-consuming to test Wieferich’s congruence for primes p up 
to 57 x 10°. 

I'll say only a few words about Lehmers’ method. If Py c Wp, then 


2 
EI 


parafa) nE} 


The idea of Lehmer and Lehmer was to construct recursively, polynomials 
SAX), fHX), with real coefficients, such that f,(2) < P,(10%), f*(2) < P*(10*) 
(for n = 1,2,...,N and 4 > 0) and such that they provide a good approxi- 
mation to P,(10*) and P*(10%). 

If <a<x,aeP,, then a = | |f- p7 (x; > 0) and to a is associated the 
point (x,,..:,x,)¢Z", with 0 < x; SE? log p; < log x. This is a one-to-one 
mapping. Therefore the determination of P,(x) (and similarly of P*(x)) is a 
special case of the following problem: " 

Given the positive real number J and any basis {o!,...,o"} of R", where 


and also 
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ol = (wi, ... ok determine the number N,(A|a!, . . . flot all lattice points 
Led <icn € Z” such that x; > 0 and S7_, xœ} < 2. 
The following is the basic recursion formula: 


N,(A|o,....@")= 3 NA — ko |o,...,0"~7), (2.7) 
Ken 


where s = [A/c]. 
Since Nlo, ...,@") cannot be easily computed, the Lehmers deter- 
mined polynomials f,(X), g,(X), of degree n, such that 


SAX) < NAO, > 50") < gl X) 


and such that the difference could be easily evaluated. The matter is rather 
technical, so I avoid any further discussion. . 
As already said, the computations were for n = 14 and gave the celebrated 


bound for the first case 
p < 253747889 


3. Thue, Roth, Siegel and Baker 


Short of proving that Fermat’s equation has only the trivial solution, a good 
substitute would be to show that (for every exponent p) the equation has at 
most finitely many solutions. 

Better still, would be to determine a number C(p)>0 such that if 
x? + y? + z? = 0, with nonzero relatively prime integers x, y, z, then 


x) lyl |z|} < C(p). 


Finally, the very best possible in this vein, would be to determine a number 
C > 0 such that if p is a prime, and if x? + y? + z? = 0, then 








max{ 


max{p,|x|Iyllzl} < C. 


Whether C would be very large, or not so large, such a result would mean 
essentially a solution of Fermats theorem. The remaining task would be the 
investigation of each prime p less than C, and with the estimates already 
known (and others as yet undiscovered) it would appear feasible to solve this 
problem. 

Alas, nothing like this is yet known. 

An early incursion along these lines is due to Turán, and later to Dénes 
and Turán. 

In 1951, with the standard methods of analytical number theory (in 
particular using the prime number theorem), Turán proved the following 
result. 

Let N > 1 be an integer, let v,(N) denote the number of triples (x,y,z) of 
integers such that ged(x,y,z) = 1, x? + y? = 2? (where p is an odd prime) and 
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1< KEES N. To prove Fermat’s theorem for the exponent p amounts to 
showing that v,(N) = 0 for every N > 1. Turän’s result, is-far from this: 
There exists a constant C > 0 such that for every N 


VN) < p(l +3 x 2UP)N2'P 


In their joint paper of 1955, Dénes and Turán first gave the following 
estimate, using elementary methods: 


vN) < p(l + 3 x 2UPyN?P, 
Using deeper analytical estimates, they obtained the better result: 


N2/P 
vaN) < C(p) (log N)? em) 


They also conjectured that 

v,(N) < CN"? 
and perhaps, for every e > 0 

v AN) < C(e)N*. 


In the above inequalities, C, C(p), C(e) denote positive real numbers. 

Turän’s conjecture follows from a much stronger result of Mumford 
[quoted in Lecture X, (7B)]. Turan’s conjecture, however, does not imply 
that Fermat’s equation for the exponent p has only finitely many solutions 
1n integers. 

The recent methods of diophantine approximation, based on the work 
of Thue, Roth, Siegel and more recently Baker, give some hope that new 
results may be in sight. Anyway, it is worth a try. 

It is not possible, in a short space, to render justice to the beautiful and 
deep theorems which I will quote. Still worse will be my omissions. 

Following an idea of Thue, Roth proved: 


(3A) Let n > 3, let 
F(X,Y) = agX" + a, X" 1Y + + ap- XY"! DEA A 


where aje Z, ag #0 and assume that the roots of F(1,X) are distinct. Let 
G(X ‚Y) # 0 have coefficients in Z, of total degree at most n — 3. Then the 
diophantine equation F(X,Y) = G(X,Y) has at most finitely many solutions 
(x,y) €Z x Z. 


The proof follows an idea of Thue and requires the famous theorem of 
Roth (1955) of approximation ofa nonrational algebraic number by rational 
numbers. 

The special case G(X,Y) = ae Z, a # 0, was proved by Thue (in 1909). 
In particular - 


(3B) If n > 3, if a, b, c are integers, a, c not zero, then the equation 
aX" + bY" = c has at most finitely many solutions in integers. 
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Taking a = 1, b = 1, and c = 7", for each z # 0 there are at most finitely 
many integers x, y such that x” + y” = z", when n is odd. This could also be 
derived using the theorems of Zsigmondy (1892) or Birkhoff and Vandiver 
(1904). 

The next result, due to Siegel (1929) has a geometric flavor. 

Let f(X,Y) be a polynomial with coefficients in Z which is absolutely 
irreducible (that is, cannot be written as a product of polynomials of smaller 
degree and coefficients from any algebraic number field). The set of points 
(x,y), with complex coordinates, such that f(x,y) = 0 is a curve. Siegel’s 
theorem states: 


(3C) If the curve defined by f(X,Y) =0 is not a rational curve (in other 
words it has genus greater than 0), then there exist only finitely many pairs of 
integers (x,y) such that f(x,y) = 0. 


In the case of genus 1, by means of an appropriate birational transfor- 
mation, the above theorem reduces to a similar one for the hyperelliptic 


equation 
aoX" + a, X"! +++ +4, = aY?, (3.1) 


where ap, ai, ---, G,€Z, ao #0, a#Q,andn>3. This special case had 
been obtained by Siegel in 1926 and in that paper Siegel mentioned the 
generalization to the curve F(X) = Y™. An explicit and direct proof was 
given by Inkeri and Hyyré in 1964, and this was extended by Schinzel and 
Tijdeman in 1976. 


(3D) If m > 2, n > 2 and max{mn} > 3, if f(X) e Z[X] has degree n and 
simple roots, and if a + 0 is an integer, then the equation f(X) = aY™ has at 
most finitely many solutions in integers. 


The more recent method of Baker concerns effective positive lower 
bounds for linear forms of logarithms. Baker considered in 1966 the linear 


form 
A=b,loga, +--:+5,loga,, (3.2) 


where n> 1, &,...,&, are any algebraic integers and b,,..., 5, are any 
integers, and log denotes the principal determination of the logarithmic 
function. 

Let d be the degree of the field Q(a,,...,%,) over Q. For every a; let 
H(a,) be its height defined as follows. Let f(X) = coX™ + en pet 
Cm be the only polynomial with coefficients in Z such that gcd(co,C1, tel = 
1 and f(a) = 0, then Hla) = Maxo <icm{|cil}- 

Let A; = max{4,H(a,)}, for i= 1, 2,...,n, numbered such that A, < 
A, << A, and let B = maxı <i<nt|bil, 4}. 
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Assuming that A #0. Baker showed (1977 i 
. that 
bound for |A] is of the form (1977) that a convenient lower 


Al > exp { - 27" *%n?2"d!5"*74(log Blos A1. (log A,)(log log A, A. 


(3.3) 


where yo, 71, Y2, Y3, Ya are positive real numb i 
> Pt» Pas Y3» ers, effect 
and not depending on n, a. b;. ively computable, 
For example, Baker showed: 


Yo = 0, Yı = 800, Y2 = 200, 73 = 200, Y4 = 0, (3.4) 
that is, 
|A| > exp{ —(16nd)°™(log B)(log A1... (log A,)(log log A. A1 (3.5) 


van der Poorten and Loxton gave in 1977 th i 
thea ere g e following better values for 


Yo= 47, =, scil 93=10, yy=10 (3.6) 


Some refinement in the form of the minorant and on the values of the 


constants may be expected. This will have an effect ‘ 
with the method. on the bounds obtained 


5 The main application of Baker’s method has been to provide effective 

` Se for solutions of certain types of diophantine equations. For Mordell’s 
uation = X“ +k (k #0), Baker proved in 1968: I j 

amd yon x? Lk thes p : If x, y are integers 


max{|x|,|y]} < exp{(10 A)", (3.7) 


B He also considered the hyperelliptic and superelliptic equations in 1969. 
et f(X) = aoX" + a X" +--+ a, with ao #0, a,€ Z, nz 3 and let 


A = max{]aoh jail, . . slan}. 
(3E) If f(X) has at least three simple roots and if x,y € Z satisfy 
J=», 


then 
max {|x|,|y|} < expexpexp(n! er" 4"). (3.8) 
(3F) If f(X) has at least two distinct roots, if m > 3, and if x, y € Z satisfy 
f(x) = y", 


then 


max {|x|,|y|} < exp exp {(5m) nto Ar"). (3.9) 


236 XI Estimates 


The most striking application of Baker’s method was obtained by 
Tijdeman in 1976. It concerns Catalan’s equation 


An Y"=1 


for arbitrary integers m, n > 2. In 1844, Catalan conjectured that if x, y are 
natural numbers and x” — y" = 1, then x = 3, y= 2, m= 2, n = 3. 

This conjecture has not yet been proved in its full generality, despite the 
efforts of many distinguished mathematicians. In some sense, it is a problem 
reminiscent of Fermat’s last theorem. It has been shown to be true in many 
special cases. Euler proved it, assuming that m = 2, n = 3. In 1850, Lebesgue 
disposed of the case n = 2. Nagell (in 1921) disposed of the cases m = 3, 
n> 2 and m > 3, n = 3. S. Selberg (1932) disposed of the case m = 4, while 
much later in 1964, Chao Ko settled the case m = 2. 

In 1952, LeVeque proved that given the natural numbers a, b > 1, there 
exist at most one pair of natural numbers m, n such that a" — b" = 1. This 
may also be seen using the theorems of Zsigmondy or Birkhoff and Vandiver, 
already mentioned. LeVeque proved also (in 1956) that given m, n> 2 
there exist at most finitely many pairs of natural numbers (x,y) such that 
xm — yr =I. 

This result was generalized in 1964 by Inkeri and Hyyrö for the equation 
X™— Y" = c (c + 0, m, n > 2 given). But both results are a simple applica- 
tion of theorem (3D). 

In 1953, Cassels conjectured that there are at most finitely many qua- 
druples of natural numbers (x,y,m,n) such that x" — y" = 1. The preceding 
results don’t show quite as much. They illustrate the force of the theorems 
of Thue, Siegel, and Roth—which therefore ought to be useful for Fermat’s 
equation. But, the main weakness is also apparent. Even though it might be 
proved that there are at most finitely many solutions, an upper bound for 
the solutions is not given. 

And now Baker’s method comes to the rescue. First, he proved, general- 
izing (3.7), that if m, n > 2 are given, if x, y are natural numbers such that 
x” — y" = 1, then 


max{x,y} < expexp{(5n)!°m'"}. (3.10) 


Tijdeman somewhat improved Baker’s lower bound for a linear form in 
logarithms and proved: There exists a number C > 3, which is effectively 
computable, such that if x, y, m, n are natural numbers, m,n > 2 and x" — y" = 
1, then max{x,y,m,n} < C. 

Tijdeman did not pause to explicitly compute the value, being assured 
that it was possible to do so. In 1975, Langevin proved that 


C < expexp exp exp 730. 


I should add that this estimate may be—and it is being—substantially 
lowered, as van der Poorten communicated to me. 
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Tijdeman’s theorem has essentially solved Catalan’s conjecture—anywa 
he settled the one by Cassels. What remains, is the examination of a 
finitely many (albeit large) remaining cases; perhaps they may lead to othe 
solutions. But these should be treated as being exceptional, or as > 
mathematicians like to say, sporadic. aoe 

Like a wanderer, I have deviated from my main path, to contemplate 
another landscape. By watching the fate of Catalan’s problem it is Se 
able to think that a similar method will give new results for F ermat’s EE 


4. Applications of the New Methods 


Yes, there are several applications of these powerful methods. But, someho 
they fall short of the expectations aroused, no doubt because F oe 
equation has four variables. The fact that Catalan’s equation had two sets of 
two variables (some in the exponent) made it more vulnerable 

The first result I want to mention is due to Inkeri, in 1946. By considering 


solutions by integers not too far a 
d part, the problem was red 
two variables. Precisely: EES 


(4A) Let n > 3, let M be a positive integer. 


a. There exist only finitely many tri i 

y triples (x,y,z) of integers such that 
y<z,x"+y"=z"andy-x<M. Í EE 
b. Idem, with z — y < M. 


In both cases 


z < expexp{(5n) An!" Ar", (4.1) 
where 


PROOF. Consider the diophantine equation 
(T + a + T" = U", 


where a > 1 is any given integer. 


Let f(T) =(T + a)" + T”. It has distinct 
t . roots, so by (3D 
equation has only finitely many solutions. onen 
FE 2, ce ee there are only finitely many integers x, z such 
x" + (x + a)” = z". This proves the first asserti is 
ee ee rtion. The other is proved 
The bound:given follows from Baker’s estimate indicated in BF). Oo 


Let me note now that, for the first case, with 
tm > A urely ele 
Inkeri himself had obtained a far better GE Nana. ae = 
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0<x<y<z,p does not divide xyz, and y — x < M, then 
Mais" 


2-1 


z< 


If z — y < M, then 
AI2MP 
2-1 

This illustrates the fact that if it is possible to obtain a bound by elemen- 
tary methods, it turns out to be incomparably better than the bounds given 
by Baker. 

Inkeri also settled Abel’s conjecture for the first case. 

I recall, from my fourth Lecture, that ifn > 2,0 < x < y < zare relatively 
prime integers such that x" + y" = z", then y, z are not prime-powers. If x 
is a prime-power then n = p is an old prime and z = y + 1. 

Inkeri showed (1946): 





z< 


(4B) Under the above assumptions, with n = p not dividing xyz, then x is 
not a prime-power. 
Proor. Ifit is, then z — y = 1. But z — y =r? (from Abel’s relations) so r = 1. 
By (1.8) and (1.13) 


s<t<r+s=1+s 
and this is a contradiction. go 


For the first case, nothing more than elementary methods were used. 
However, for the second case, with the powerful methods, Inkeri only reached 


partial results: 

(4C) Let p > 3. Then there exist at most finitely many triples of relatively 
prime integers (x,y,z) such that 0 < x < y < z, x? + y? = 2? and x is a prime- 
power. For each such triple z = y + 1, piy + 1) and 


y <expexp{2?(p — PEN < exp exp(2p!°)?’. 
Proor. Since z — y = 1, taking M = 2, the first assertion follows from (4A), 
part (b). By (4B), p divides xyz, but it is easily seen from x? = (y + 1)? — y” 


that p y x. So p|y(y + 1). 
The bound results by applying Baker’s estimate given in (3F). oO 


In the above situation, in view of Wagstaff’s computations, p > 125000. 


By Inkeri’s lower bound for y, given in (1D), 


pr~! < y < expexp(2p"°)”” 


so the above interval is much too large to derive any practical consequences. 
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RE E result exhibits quite an important new feature, an effective 
a = E SECH ee however, a strong hypothesis, namely 
med that z — x is fairly large with respect to i 

D D D E X I 
this is not to be expected in view of (1.3). = g GER 
The following theorem was proved by Stewart in 1977, in a slightly more 


general form. A variant was discovered ind 
A v ependently by I i 
der Poorten. I indicate a special case: P AE ER 


(4D) Let M > 0 be a given real number, let p be an odd prime and let 0 < 


x < y < z be relatively prime inte 
condition d gers such that x? + y? = z? If the following 


y — x < M(z — H "(Ne (4.2) 


is satisfied, then there is a positive real 
» th number C, dependi i 
may be explicitly computed, such that Ge 


p<C. 


Ges ive ae fairly detailed proof, which is an amalgam of those of 
rt and of Inkeri and van der Poorten, so as to i 
EE S o illustrate the method. 


z—x=b?, z—y=p'a” when p|x, (4.31) 
z=x=p 1b, z-y=a? when p| y, (4.311) 
z—-x=bP, z-y=uaP when p{ xy, (4.311) 


SC a, b are rational numbers, here so arranged that 1 < a < b. This fol- 
ows at once from Abel’s relations, in Lecture IV. 


Note that since z>y>x,z—-x>2,h i 
D A = 2, hence z — x > 2’, or in the s 
SCH z — x =p 'b?, with p|b, hence also z — x > p~! > 2? oe 
et r=0, 1, 0 and g=p~— 1, 1, 0, respectively, in cases 
» 4, ‚ 1, 0, respectively, in cases (4.31), (4.311 
(4.3111). I claim that there exists a positive real number c such = = Á 


(y — Alb pb oe p 
Indeed, since 3 < 2709s»? < pileer)” if(y — x)p'b™? > 4, then 
bU? A <(y—x)pb”?, 


So let (y — x)p"b™? < 4. But 


Sirus 2) a\P 
(y — xp == = 1 - -1- (5) #0 








“eX z—x 


It is easily seen that 
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oul 2 
1— pif z 
dk 
Applying Baker’s lower bound (3.5) to the above linear form in logarithms 
it follows that 
(y — x)p"b-? > Zexp{—c(log B)(log A,)(log A,)(loglog 4,)}, 
where A, = p, A, = max{a,b} = b, and B = p max{q,p}. Thus 


(y—x)p’b°? = eer + plos( | 


b 








(y — x)p'b"? > exp{ —c(log p)(log b)(log log p)} > b708 P”, 
On the other hand, by hypothesis 
y= x < Mis = x)! "UND = M(p "bey! UND, 
so 
boo toe PY? < (y — ab E M(p'b P~, 
But r = 0, I so 
bY? — c(logp)> < Mp™? < M. 


Either ./p < c(log p)? or c(log p}? < /p, hence 2¥?~ ies P} bp leen < M, 
so 4/p — c(log p)? < log M/log2. In both cases there exists a real positive 
number C such that p < C. g 


A simpler and more suggestive statement is the following earlier version 
of Stewart’s result: 


(4E) Let ¢ > 0 be a real number, let p be an odd prime, let0<x<y<z 
be relatively prime integers such that x? + y? = z? and y — x <2""®P, Then 
there exists an explicitly computable constant C such that p < C. 


PROOF. It suffices to take M = 2”. Then 
ve 20-9 < M < M(z— x) "UNEL 


By the preceding result, there exists an explicitly computable real number C 
such that p < C. 


In fact, Stewart proves the results cited above for an exponent which is 
not necessarily prime; this constitutes a nontrivial generalization: 


(4F) Given M > 0, there exists a number C > 0 (which may be explicitly 
computed in terms of M) such that if n > 3 is an integer, if x, y, z are relatively 
prime integers such that x" + y" = z" with O < x < y < z, then: if 2<z- 
y<M or if y—x <M, it follows that x, y, z, n are less than C. 


Inkeri and van der Poorten proved the following rather technical result: 


(4G) Let p be an odd prime, let I,,..., lm (with m > 0) be distinct primes, 
L < p;letw,,..., Wa be natural numbers. If x, y, z are relatively prime integers 
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such that O < x < y < z, x? + y? = 2? and [ Jr. , I} divides y — x, then 


e 1 —{L(log p)3/(p— 1 
HS e > (z — al "lee p/p- D} 
i= t 


where L = BU + l, +: + ln), with 





z—x =D? z—-y=p’"!a? when p|x, 
DEET b, z-y=@ when p|y, 
z—x=b?, z—-y=a?P when p ¥ xy. 


With this /-adic generalization, one cannot only show that y-x< 
M(z — x)! "vn implies that p is bounded in terms of M, as above, but in 
fact that p is bounded in terms of the greatest prime factor of M: 


(4H) Given the prime numbers |,,..., |, and an integer Ma > 0, there is 
an effectively computable number C > 0 such that if x, y, z are relatively prime 
integers, 0 < x < y < z, xX? + y? = SÉ, and if 


y= x< Mo% [em (z — x) TOND) 
for any nonnegative integers w4, ..., Wm, then p < C. 


The common idea behind these attempts is to find an effectively com- 
putable number which bounds the exponent. If this is done without any 
assumption, it amounts, at least theoretically (as I already said), to the solu- 
tion of Fermat’s problem. Up to now the bounds on the exponent could 
only be obtained under various more or less technical hypothesis on the 
nature of the would-be solutions. It may be possible that this kind of infor- 
mation, coupled with results of some other type, will indeed be very important 
for the solution of the problem. 

Who knows? 
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LECTURE XII 


Fermat’s Congruence 


In this lecture, I will turn my attention to an analogue of Fermat’s theorem. 
Instead of the equation, it will be a question of a congruence. In addition 
to the intrinsic interest of this modified problem, I mentioned in my fourth 
lecture how Sophie Germain’s criterion for the first case involves Fermat’s 
congruence modulo some prime. Accordingly, I will begin by studying the 
Fermat equation over prime fields. 


1. Fermat’s Theorem over Prime Fields 


Let p, q be primes, and consider the congruence 
X? + Y’ + Z’ = 0 (modą) (1.1) 


or equivalently, Fermat’s equation over the field F, with q elements. 

If p = q, then since a? = a (mod p) for every integer a, not a multiple of 
p, (1.1) obviously has solutions (x, y,z) where p / xyz. This case is uninteresting. 
The same thing happens if p or q is equal to 2. 

So, I assume that p, g are distinct odd primes. A nontrivial solution of 
(1.1) is a triple of integers x, y, z, such that they satisfy the congruence (1.1), 
andl <x,y,z<q-1. 

Let N(p,g) be the number of nontrivial solutions. It is equal to the number 
of pairs (x, y) such that 1 < x, y < q — 1 and x? + y? + 1 = 0 (mod q). I will 
state theorems which guarantee that N(p,q) > 0. This is quite important in 
view of the following observation, already known to Libri (1832): 


(1A) Let p > 2 be a prime. If there exist infinitely many primes q such 
that (1.1) has only the trivial solution, i.e., N(p,q) = 0, then Fermat’s theorem 
is true for the exponent p. 


246 XII Fermat’s Congruence 


Proor. Assume that there exist nonzero integers x, y, z such that 
xP + yP + 2? = 0. 


Let gu... d be the prime divisors of xyz. Let q be any prime, q > 
max{q;,....gn}. Then, for each such prime q, (x,y,z) is a nontrivial solution 
of (1.1), that is N(p,q) > 0. Thus, if N(p,q) = 0 then q < max{q;, ... xl, This 
contradicts the hypothesis. o 


However, the assumption made in (1A) is actually false. For example, 
Libri proved that 


N(3,q) > q — 8 — y4q — 27 when g = 1 (mod 3). 


So, if g = 1 (mod 3), q > 19, then N(3,g) > 0. 
Pépin computed in 1880 the exact value of N(3,q) (see also Landau, 1913): 


N3,q)=q—-—8 +1, 


where 4q = I? + 27m? (l, m are integers) and / = 1 (mod3). For example, 
N(3,19) = 4, N(3,31) = 19, N(3,37) = 18, N(3,43) = 27, N(3,61) = 52. 

In 1887, Pellet showed that for every prime exponent p there exists a 
number go(p) such that if q is a prime, q > go(p), then N(p,q) # 0. However, 
his proof did not provide any indication of the value of q,(p), nor about the 
number N(p,g). In a later paper of 1911, Pellet gave bounds for the number 
N(p,q). 

In 1909, Cornacchia gave an upper bound for qo(p). In the same year, 
Dickson indicated a more accurate upper bound, namely 


qo(P) < (p — 1)°(p — 2)? + 6p — 2. (1.2) 


Dickson also computed a lower bound for the number N(p,q). 
Dickson’s proof involves rather lengthy computations in the cyclotomic 
field. A shorter proof, yielding the less accurate upper bound 


del pl < (p!)e + 1 (1.3) 


is due to Schur (1917). It is reproduced in LeVeque’s book (1956). 

Concerning the number of solutions, already in 1837 and 1838 Lebesgue 
used the methods of Gauss and Libri to determine a general formula for the 
number of pairwise incongruent solutions of a congruence modulo a prime. 
He applied this formula to the congruences 


a,x] + aX? +e + aX = b (mod al, (1.4) 


where q = 1 (mod m), m > 2. 

Also in 1909, Hurwitz studied the congruence (1.4) for a prime exponent 
m = p, and b = 0, and thereby extended the results of Dickson. 

In connection with Waring’s problem, Hardy and Littlewood studied in 
1922 the special case where a, =-::=a,=1. This congruence is the 
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starting point in the theory which led to the Riemann hypothesis for func- 
tion fields over finite fields, the beautiful work of Hasse and Weil. A very 
illuminating paper on this topic is that of Weil (1949). It will be clear, as I 
proceed, that many of the methods used in evaluating the number of solu- 
tions of congruences have evolved from original ideas of Gauss. 

So much for the history of this question. Pll now explain Dickson’s 
theorem, and give an outline of the proof. To begin with, I will isolate the 
trivial special case. 


(1B) Let p > 2 be a prime. Let q be a prime such that q = 6mp + 1 (m 
integer) or p does not divide q — 1. Then 


X? + Y’ + Z’ =0(modq) 


has a nontrivial solution. 


Proor. If pX q — 1, if a, b are integers such that ap + b(q — 1) = 1, choosing 
integers x, y, z, not multiples of q, such that x + y + z = 0 (mod p), then 
x? = x (mod q), y” = y (mod q), z*? = z (mod q). Hence (x*,y*,z’) is a non- 
trivial solution of the congruence. 

Now, if q = 6mp + 1, if h is a primitive root modulo q, then (inn? = 1 
(mod q). Let h = h(mod q). So h?”? is a primitive cubic root of 1 in the field 
F,. Thus 


1+ heme 4 home — 0 (in EA 


Hence 
1 + herr + ho? = 0 (mod al o 


From now on, I may assume that q = kp + 1, where k is even and more- 
over 3% k. 

The proof of Dickson’s theorem involves the Jacobi cyclotomic sums, 
already used by Kummer (see Lecture VID. I recall the main definitions 
and properties. Let 


g = a primitive root modulo p, 
h =a primitive root modulo q, 
¢ =a primitive pth root of 1, 
p = a primitive qth root of 1. 


The p periods of k terms in the cyclotomic field Q(p) are: 


(k~ 1)p 


ho =p +p” +p +--+ peer”, 


h hP+1 h?P+1 h(k-1)p+1 
= ... , 
m= prt pe Terre (1.5) 


1 


hp- =P Të +p 


kän 3 hkp-1 


++p 
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Consider also the Jacobi sums, for every j = 0,1,...,p — 1. 


q-1 


t= py = A. inp’, (1.6) 


t=1 


where ind,(t) = s, 0 < s < q — 2, when t = h (mod q). In particular 


q-i 
to = <p) = E p= nh (1.7) 

t=1 

I recall that 
Tjtp—j 7 q (1.8) 
Titi = 4 (1.9) 
(€, denotes the complex conjugate of t,). Hence 

Tp—j = Tj (1.10) 


The periods may also be easily expressed in terms of the Jacobi sums: 


p-l 
mat „Ca, (fori=0,1,...,p — 1). (1.11) 
P j=0 
With all these concepts, I now indicate an expression for N{p,g) in terms 
of the periods: 


Lemma 1.1. 


1 pol 
NE = 7 D — 1) +(q— lp’ D n| 
Proor. If x, y, z are integers, 1 < x, y, Z < q — 1, then 


al N when x? + y? + z? £0(modg), 


t(xP + yP +zP) = 
2% P q when x? + y? + z? = 0 (modą). 


So 


azl Vë P+yP+zP) 
Nip.) = A. (z eg 
x. t= 


ai a P Sp 
txP ty Z 
ppp 
= x,y,zZ=1 


d i 
E - 
mi IM 
ma 
iM 
>, 
H 
nS_ 4 
Lu 
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Now if t = hi and x = h’, where 0 < i, j < q — 2, then 
q-1 q-2 Bu 
3. pi? = H oh Pi pn;. 
x=1 j=0 
Since i = i’ (mod p) implies that y; = 7; it follows that 
p-1 
qN(p.q) =(¢q-1P +k X pn, 
i=0 
where q = kp + 1. This concludes the proof. O 


Based on the above standard lemma, Klösgen in 1970 simplified the 
proof of Dickson’s theorem. I'll sketch this, omitting computational details. 


(1C) The number N(p,q) of pairwise incongruent nontrivial solutions of (1.1) 
satisfies: 


(q~ DIa +1- 3p — (p — 1p - 2) /4] 
< N(p.q) < (4 — DEg + 1- 3p + (p - 1p — YA]. 
Hence, if q 2 (p — 1)’(p — 2)? + 6p — 2, then N(p,q) > 0. 
Proor. By (1.11) and the lemma: 
Npa) =t [u — mm Am Us Lie fr { | 
q P` i=0 (j=0 


This may be brought to the form: 


Ida “j3 jijzjs 


TEE HEEN E Tj TTA | 


jis jz j3=0 
where 
-1 l Ea 
KËNT y gitti) — p when j, te ra # 0 (mod p), 
me p when j, + j2 +j, = 0 (mod p). 
Since tọ = — 1, 
q-1 
Nina) = "e la ~ 1}? — 1 — 3q(p — 1) + S], 
where 
p-1 
S= ; H Znfrtr, 
. , gis J2,f3=1 
Jit Jat j3=0(mod p) 
So 
7_ı “+1-3p|=-IS] 
q-1 4 p) zl | 
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But |r, = J4. Also, for every jı, 1 <j, < p — 1, there are p — 2 pairs (j2,j3) 
such that 1 < j2, ja < p — 1 and ji + ja + j3 = p or 2p. Hence 


|S] < (p - 1)(p - 2)q°””. 
Hence 


E Sped 


This leads at once to the first inequalities. The last assertion is now easily 
deduced. m 


An application of the general Hasse and Weil theorem (the Riemann 
hypothesis for projective curves over finite fields) yields the same bound 
for qo(p). Indeed, let N*(p,q) be the number of points of the curve X? + Y? + 
Z? = 0 in the projective three-dimensional space over the field with q ele- 
ments. Thus N(p,g) = N*(p,q) — 3p, since there are 3p points with one co- 
ordinate equal to zero. The curve is nonsingular, with genus g =(p—1)(p—2)/ 
2. The general formula is 


IN*(p,q) — (4 + D| < 294. 


Hence ifg+1>(p— 1)(p - 2)Jq + 3p, then N(p,q) > 0. A simple compu- 
tation leads to the same upper bound for qo(p). 

It should be pointed out that the bound for qo(p) is not the best possible. 
For example, by actual computation: 


Go(3) < 20, while N(3,q) = 0 only for q = 7, 13; 


qo(5) < 172, while N(5,q) = 0 only for q = 11, 41, 71, 101; 
go(7) < 940, while N(7,q) = 0 only for q = 29, 71, 113, 491. 


The generalization of Dickson’s theorem due to Hurwitz is the following: 


(1D) Let p, q be distinct odd primes, q = kp + 1. Let a, b, c be integers 
and let N denote the number of solutions (x,y,z), 1 < x, y, z <q — 1, of the 
congruence 


aX? + bY? + cZ’ = 0 (mod al (1.12) 
Then 
(q - Dia + 1) — (p — Us - Da - p] 
<N <(q- Dia +1) +p- Dp- 2V4 - m) 
where h is a primitive root modulo q, 
a = k (modq), b = h (modq), c = k (mod q), 
0<r,s,t <q -— 2, and 
0 whenr, s, t are pairwise incongruent modulo p, 
v=43 whenr, s,tare pairwise congruent modulo p, 
[ otherwise. 
Ifq 2 (p - 1)(p — 2)? + 2(pv — 1), then N > 0. 
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2. The Local Fermat’s Theorem 


One of the methods to study diophantine equations consists of searching 
for solutions in q-adic fields (for every prime q). In some cases, if the equation 
has a solution in every q-adic field Ô, then it also has a solution in Q. For 
example, this happens for quadratic equations. In such instances, it is said 
that the Hasse principle or the local-global principle holds. 

But, for Fermat’s equation, I’ll show that the local global principle is not 
satisfied. The basic result to be used is the lemma of Hensel (1908), which I 
will state in its original stronger form. 

Let Q, be the field of g-adic numbers, 2, the ring of g-adic integers, and 
v, the q-adic valuation of Q,. 


Lemma 2.1 (Hensel). Let F(X) be a monic polynomial with coefficients in 
2, Let Go(X), Ho(X) be monic polynomials with coefficients in Z, and let 
R = Res(G)(X),Ho(X)) € Z 


be the resultant of these polynomials. 
If r = vR) > 0 and if F(X) = Go(X)Ho(X) (mod 4°), where s > 2r, then 
there exist polynomials G(X), H(X) e Z,[X] such that 


G(X) = Go(X) (mod a 
H(X) = H(X) (mod q*~”), 
and F(X) = G(X)H(X). 


For example, if the residue classes of Go(X), Ho(X) modulo q are relatively 
prime polynomials in F [X], then R = 1, r = 0 and s may be taken to be 
equal to 1. 

The most useful special case is the following: 


Lemma 2.2 (Hensel). If F(X) is a monic polynomial with coefficients in Z, 
and if ae Z is a simple root of the congruence 


F(X) = 0 (mod q), 
then there exists a q-adic integer o e 2, such that o = a (mod q) and F(a) = 0. 
I owe the proof of the following result to Brettler (1974): 


(2A) For every prime q and every prime p, Fermat’s equation XP + Y? + 
ZP = 0 has a nontrivial solution in the field of q-adic numbers. 


ProoF. If p = 2 this is trivial, since there are already nontrivial solutions in 
Z. Henceforth I assume p # 2. 


First Case. q + P. 
Let F(X) = X? + q? — 1. Then 


XP + gq? —1 = (X — DUT Xr? 4-4 X + 1) (modg). 
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Since 1 modg is not a root of XP"! + XP”? +--+ X +1 modulo q, 
by Lemma 2.2 there exists a g-adic integer « such that « = 1 (mod.g) and 
a? +g? +(—-1)?=0. 


Second Case. q = p. 


Let 
F(X) =X? +p? +1, 
G(X) = X + 1, 
H(X) = XPT! — XP? ee oh 


The resultant R = Res(Go(X),Ho(X)) = Ho(— 1) = p, so its p-adic value 
is v (R) = 1. Since Go(X)Ho(X) = X? + 1, 


F(X) = Go(X)H (X) (mod pP). 
Noting that p > 3 > 2v, (R), by Lemma 2.1 there exist monic polynomials 
G(X), H(X) e 2,[X] such that 
G(X) = G(X) (mod p?~*), 
H(X) = H(X) (mod pP”), 
and F(X) = G(X)H(X). 
Then G(X) = X + a, where «e Kë «x = 1 (mod p?~!) and —a? + p? +1 
=0. O 


Another problem would be to determine when Fermat’s equation has a 
solution in units of the field of g-adic numbers. It is immediate from the 
definitions that the following holds: 


(2B) If p, q are odd primes (not necessarily distinct), the following condi- 
tions are equivalent: 


a. There exist units a, P, y € Z, such that ap + B? + y? D 

b. There exist integers Xo, Yo, Zo, not multiples of q, such that 
xP + y8 + z8 =0(modq'*?). 

c. For every n > 0 there exist integers Xn, Yn, Zu, not multiples of q, such that 


xP + y? + zf = 0 (mod q"*!*°) 


and Xnt1 = Xn (mod ar" Yn+1 = Yn (modq"* t), Zn+1 = Zn (mod th), 
where 

_ $0 ifg#p 

“1 ifgq=p 


This leads to the study of Fermat’s congruence modulo the powers of 
a prime. 
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3. The Problem Modulo a Prime-Power 


I will consider in this section the congruence 
XP" + yr” + ZP" = 0 (mod p”), (3.1) 


where p is an odd prime and n >m > 1. When does it have a solution in 
integers not divisible by p? 

It is possible to assume that n = m + 1. Indeed, a simple argument shows 
that if x, y, z are integers, not multiples of p, and such that 


xP” + yP™ + z2P" = 0 (mod p™* 1), 
then for every r > 0 there exist integers x,, y,, Z,, not multiples of p, such that 


x?” + yP” + 2?” = 0 (mod p™*!*") 


and x,+1 = x, (mod p"**), y,+1 = Y, (mod p’*"), 2,41 = Z, (mod p'*'). 


Thus, from now on, I shall take n = m + 1 in the congruence (3.1). With 
the same methods it is also possible, and in fact quite interesting, to study 
the congruence 


XP" 4 KÉ + XP" = 0 (mod p™*!), (3.2) 


As usual, a (nontrivial) solution (x,,...,x,) consists of integers satisfying 
the congruence and such that 1 < x; < p™*! — 1, p} x, (for all i = 1,...,k). 

Two solutions (x,,...,X;), (Y1 - - - ul are equivalent if there exists some 
integer a, not a multiple of p, 1 < a < p™*' — 1, and a permutation x of 
{1,2,...,k} such that y; = ax, (mod p"*") for i=1,...,k. 

I denote by ā the residue class of a mod p”*' and by (im? 1)* the multi- 
plicative group of invertible residue classes modulo p”*'. 

Let U be the-subgroup of all p”th powers Op" and let V be the subgroup 
of all bauch that b = 1 (mod p). It is an elementary fact that (Z/p™* +)” 
U x V. Leth > land 


h 
n=} aleet bei Ab 
i=1 
Then (3.2) has a (nontrivial) solution exactly when De kU. 

Let g be a primitive root modulo p, 1 < g < p and let r = g?” (mod p 
1<r<p"*!. Then U = {r7?7°,... 7?" + =T}. In other words, given g, 
every element X?” e U is uniquely equal to some power? (with 0 < i < p — 2). 
So every solution of (3.2) corresponds bijectively to a representation of 0 
as sums of powers of F in (Z/p"*')*: 


rit +r + +++ + r* = 0 (mod p™*!) 
with 0 < i <p — 2 (for t = 1,2,...,k). 


Two representations (r'4,r’?,...,r'*) and (r,r#,.... r°) are equivalent 
when they correspond to equivalent solutions of (3.2). In other words, there 


MEEL) 
3 
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is a permutation z of (1,2,...,k} and an integer h, 0 < h < p — 2, such that 


i, = jrm + h (mod p — 1) 
fort=1,...,k. 
A representation (r"', . . . ,r'*) of 0 is normalized when 


It is clear that every representation is equivalent to one which is normalized. 
However, it may be shown that an equivalence class of representations may 
contain more than one which is normalized. 

A cyclic solution of (3.2) is a solution (x41,X2, ...,x,) such that there exists 
an integer a, p Fa for which 


x, = 1 (mod p™*?), 


x, = a (mod p™*?), 


x; = a”! (mod p™*’), 


x, = ar"! (mod p™*!). 


The corresponding representation of 0 is 
L+ritr7it--+ + r& = 0 (mod p+’), (3.3) 


where a = r (mod p"*!,0<isp-2. 

It is easily seen that if p = 1 (mod k), then there is a cyclic representation, 
namely taking i = (p — 1)/k. In particular, if k = 3, m = 1, taking i = (p — 1)/3, 
then 

1+ ri +r” = 0 (mod p°). (3.4) 


In this case r! is a cubic root of 1 (mod p°). 
Klösgen showed (in 1970) the following criterion for Fermat’s theorem, 
based on the existence of a noncyclic representation of 0: 


(3A) Let m > 1, let p be a prime, p = 1 (mod 3). If there exist integers 

x, y, z, not multiples of p, such that 

xy Kat z =O, 
then 0 has a noncyclic representation modulo pn"). 
Proor. To begin, Klésgen uses a generalization by Inkeri (1946) of Fleck’s 
congruence [see Lecture IV, (3A)]. Namely, x? = x (mod p*”*'). Raising 
to the pth power repeatedly, this gives x?” = x?” (mod pi" TL, Similarly, 
ye” = yP” (mod p*"*1) and z?”” = 2?” (mod p*"*?*). 

By hypothesis, x?”” + y?’" + zP’” = 0 (mod p*"*'). Let w be an integer 
such that wx = y (mod p). From x+y+z=0 (mod p) it follows that 
z=x(p— | — w) (mod p), so x?" (1 +w” +(p— 1 — w)?” =0 (mod p™ +1), 

It suffices to show that the solution (1, w, p — 1 — w)is not a cyclic solution. 
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Since Lë" "up +'(y?""')? + (ep "IS = 0, it follows from Pollaczek’s result 
(1917) that sin" + (xy)?""' + y??""* Æ 0 (mod p). Hence x? + xy +y? #0 
(mod p). This implies that the solution is not cyclic, because if 


(p — 1 — w)” = w??"™ (mod p3"*1), 


then (1 + w)?”” + w??”” = 0 (mod p) hence 1 + w + w? = 0 (mod p), which 
is a contradiction. o 


For example, let p = 1 (mod 3). If the only representation of 0 modulo 
p* is the cyclic representation, then the first case of Fermat’s theorem holds 
for the exponent p. 

Klösgen described all the normalized representations equivalent to a 
given one. In 1965, Peschl also showed how to obtain new normalized 
solutions from a given one (under certain hypotheses), but I'll refrain from 
entering into more details. 

Instead, I now turn to the study of the number of solutions of (3.2). The 
method is similar to the one already indicated for the congruence modulo p, 
which gave Dickson’s theorem. 

Let p > 2 be a prime number, let k > 3 and m > 0. I denote by F(p,m,k) 
the number of (x4, ...,x,) such that 1 < x; < p —1(fori=1,...,k)and 


xP" + xB" +--+ + xf" = 0 (mod p+ +). 


Similarly, if a is an integer, 1 < a < p — 1, let F(p,m,k;a) be the number 
of k-tuples as above, such that , 


x?” + x5” ++ xP" = ap" (mod p™*?). 
Let N(p,m,k) be the number of (a... Au such that 1 < x; < p — 1 (for 
i=2,...,k) and 
1+ x8" +--+ + xP" = 0 (mod p™*?). 


For k=3 and m=1, I shall simply write: F(p) = F(p,1,3), F(p;a) = 
F(p,1,3;a), N(p) = N(p,1,3). 
For m = 0, it is not difficult to see: 


` bi ` (ak 
panne — NO u IT (3.5) 





(p — 1 +(- I 
> 





N(p,0,k) = (3.6) 


In particular, 
F(p,0,3) = (p — 1)(p — 2), 
N(p,0,3) = p — 2. 

More generally: 


(3B) With above notations: 
1. F(p,m,k) = (p — 1)N(p,m,k). 
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2. F(p,m,k;1) = F(p,m,k;2) = = F(p,m,k;p — 1). (This number shall be 
denoted by F*(p,m,k).) 
3. N(p,m,k) = N{p,m — 1,k) — F*(p,m,k). 


In particular: 


Ce) E 





N(p,1,k) = (p,1,k). 
p 
Thus, the determination of the number of solutions of (3.2) depends on 
the determination of F*(p j,k), for j = 1, 2, ..., m. To this extent, the above 


relations play only an auxiliary role. 

To arrive at an explicit formula for N(p,m,k), Klösgen introduced the 
periods of the cyclotomic field Q(¢), where ¢ is a primitive p”th root of 1 
(p an odd prime, m > 1). 

Let 


h = primitive root modulo p". 
p = primitive p”*'th root of 1, such that p? = ¢. 


The Gaussian periods of p terms are ņ; = n,(p,m,h) defined as follows: 


Ni = p" + fe + peer = Bae + DAE (for i = 0,1, o. ‚p" er 1). 
(3.7) 
Clearly 
pr-1i 
E n=0. (3.8) 
i=0 


Extend the definition of n, for every index j, by letting 4; = ;, where 
j =i(mod p”),0<i< p”-— 1. 

Up to a change of numbering, the Gaussian periods are independent 
of the choice of the primitive root h. 

Each y; turns out to be a real number and 


In| <p—1. (3.9) 


If p ft let ind,(t) = s, where 0 < s < (p — 1)p™ — 1, and t = h’ (mod p™*’). 
The Jacobi sums are defined as in (1.6): 


t= (Cp) = ` H Cindi (for j = 0,1,...,p"— 1). 
Gi 
In particular 
T = (Lois 
More generally, 
t;#0 ifandonlyif p/j. (3.10) 


Also 
Tj == Tym — j (3.11) 
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(t; denotes the complex conjugate of t,) and 


m+ 1 2 
f when p Yj, 


0 when pl paa 


As a generalization of (1.11), the following connection between the periods 
and the Jacobi sums may be shown. 


Consider the matrix Z = (kan pm—, and the vectors 


Ser 


To No 
Ty 

t=] and „= i 
Tpm-1 Npm-1 


Then ZZ = p"! (I the identity matrix), that is, 


2 GGH = pôw (fori, j=0,1,...,p" — 1). (3.13) 
Also, Zt = p"n, that is, 
= Cie ppm (for i= 0,1,...,p” — 1) (3.14) 
and t = Zn, ie 
on Cin, =, (fori=0,1,...,p"— 1). (3.15) 
As a corollary, ine following special sums of periods vanish: 
we 
> Ni pm = 0 (3.16) 


foril<k<mandi=0,1,...,p"*-1. 
Taking for example, m = 2, k = 1, then 


p-i 
È Nis jp = 0. (3.17) 
j=0 
It is also possible to evaluate certain sums of squares of periods: 
pk~1 
È nhim = p1) (3.18) 
j= 
fori <k<mandi=0,1,...,p™ *—1. 
In particular, taking k = m, then 
peat 
Ze (3.19) 
i= 


For m = 1, the Jacobi sums for i= 0, 1,..., p — 1 may be evaluated as 


follows: . 
q= pp”, (3.20) 


where a(p — 1) = ind,(1 + p). 
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In analogy with Lemma 1.1, Klösgen showed the following inductive 
expression for F(p,m,k) and N(p,m,k): 


(3C) With above notations: 
1. F(p,m,k) = (1/p)F(p,m — 1,k) + ((p — D/p"**) YP ot ni. 
2. N(p,m,k) = (1/p)N(p,m — 1,k) + (A/p™**) Po! nf. 


Taking m = 1, k = 3, this gives: 





(an Ss SE H 
nai = B 242 In 321) 


N) = + pap (3.22) 


Let S(p,k) = 3 ei fain, It, Then, the above recurrence relations 
become: 


F(p,m,k) = ae- D+ aen) (3.23) 


j=1 
N(p,m,k) = = lt — 1} + +(-1%+ > son) (3.24) 


The next task is to find an upper bound for the number of solutions 
N(p,1,3) = N(p). In view of (3.22), what is required is to find an upper bound 
for the sum 3 EJ n;. Taking into account the relations (3.8) and (3.19), the 
question is answered by the following: 


Lemma 3.1. Let n > 3, let 
v1 --- Va) = 2; ye 


defined for all points (y1,...,Y,) such that Y?_, y; = Oand Al y? = n(n — 1). 


1. If the function f assumes an extreme value at the point (y1,...,y,), then 
there exists an integer T, 1 < T < n — 1, such that (up to a permutation of 
indices) 


y= = yp =(n—-T) Tin- TY 





EE T i 
YT+1 Yn T(n — T) 


If y7 is the point with above coordinates, then: 


2. oT) = n(n — Dim — 2T)/(n — 1)/T(n — T). 


3. If T =1, then f(y") = n(n — 1)(n — 2) is the absolute maximum of the 
function. 
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4. If1 < T <n — 1, then f(y") is the absolute maximum of the function on the 
points (Y4, . . - ,Yn) of the given domain, such that 


~1 
Ven): i 
With this lemma, it may be shown that 
(3D) N(p) < max {y:|0 < i< p — 1}. 


Another question which may be answered is the asymptotic behavior of 
the number of solutions, as k tends to oo: 





—1 k-1 
Indeed, from the previous formulas: 
N(p,m,k) (—1)* T Spik) 
an ari = I +H -1 
GE pi tpt E g- 


Z Els a rf Tode 
Spt > [Mee Si 

Since |n;(p,,j)| < p — 1 (by (3.9)) it follows that Dm, „[n:(p,)/(p — DJ’ = 0 
proving the assertion. 

Also, since 0 < N(p,m,k) < N(p,m — 1,k) [by (3B)], there exists an integer 
Mo = mo(k,p) > 1 such that N(p,m,k) = N(p,mo,k) for every m > mo. This 
number may be interpreted as being equal to the number of solutions of the 
equation 

1+X,+:'+X,=0 


by elements in U, (the multiplicative group of(p — 1)th roots of 1 in the ring 
Z, of p-adic integers). 

Klösgen computed the values of N(p,m,k) for various values of m, k. To 
illustrate these results, I give some examples below: 


Table for N(p,1,k) 





5 0 9 0 100 35 
7 2 15 60 340 1680 
11 0 31 24 1600 5250 
13 2 33 200 2260 21630 
17 0 57 140 6220 50120 
19 2 51 390 6880 101430 
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Table for N(p,2,k) (with values underlined, 
when not the same as in the preceding table) 





k 

INE 4 5 6 7 

5 0 9 0 10 0 

7 2 15 60 340 ` 1680 
11 0 27 24 1090 2520 
13 2 33 180 1930 15540 
17 0 45 0 3160 945 
19 2 a 300 4600 44520 





The study of such congruences was also the object of numerous papers 
by Vandiver and by Hua, but their work is beyond the scope of my lectures. 
For the same reason, I shall also make only a passing reference to the work 
of Weil (1949), Igusa (1975) and others, on the number of solutions of diagonal 
forms f(X) = X} ++ X over finite fields F,. and rings Z/Zp’, where p 
is a prime not dividing d, s > 1. Let N, (respectively N¥) denote the number 
of solutions of f(X) = 0 in F, (respectively Z/Zp’). Weil and Igusa proved 
respectively that F(t) = 1 + Ei Nt, F*(t) = 1 +$, NI are rational 
functions ofthe indeterminate t. Stevenson (1978) gave a new proof of Igusa’s 
result, using the same method of Weil. 
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LECTURE XIII 


Variations and Fugue on a Theme 


As composers sometimes do, when they strike a rich theme, mathematicians 
also like to consider variations of an interesting problem. 

The theme is Fermat’s problem, in all its force and splendor. The varia- 
tions are analogous problems, sometimes reaching new depths, often only 
a caricature of the original problem. And the fugue, which usually ends the 
whole composition, keeps some of the original atmosphere. It begins with 
the well-known theme which, one might believe, has been completely ex- 
ploited. Yet, with a happy use of technique, it gains in substance and soon 
transcends the original idea. 


1. Variation I (In the Tone of 
Polynomial Functions) 


Consider the problem of finding the solutions of the equation 
X" + Y"=Z" (1.1) 


(where n > 2) in the field of rational functions K(t) (where K is a field and t 
is transcendental). 

Since the equation is homogeneous, it is equivalent to find solutions in 
polynomials that is, elements of K[t]. But this ring is a unique factorization 
domain, so ‘it suffices to look for the nontrivial solutions (f,g,h), where 
f, g, he K[t] and ged(f,g,h) = 1. 

In 1879, Liouville studied the problem when K = C, the field of complex 
numbers; he used analytical methods. In 1880, Korkine gave another proof, 
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which is algebraic and valid for any field K of characteristic 0. Another 
analytic proof with comments is in Shanks’ book of 1962 (page 145), as well 
as in a paper of Greenleaf (1969). This latter proof actually is valid over any 
field whose characteristic does not divide the exponent in Fermat’s equation. 

First, I consider the case n = 2. If K has characteristic 2, then any sum of 
squares is a square. So this case is trivial and may be left aside. 


(1A) Let K be a field of characteristic distinct from 2, and assume that 
there exist elements a, b, ce K such that a? + b? c?. Then the equation 
X? + Y? = Z? has primitive nonconstant solutions f, g, h in K[t]. If f has 
degree 1, and f? + g? = h?, then 


feat ta, 
a 





g=btars (FP) a%, (1.2) 
2a? 
h=c+dt+ Ge 


where de K,d = 0. 


The proof of this result contains no secrets. 
Now, I consider the case n > 2 and I present the proof of Greenleaf: 


(1B) Let n > 2 and let K be a field of characteristic not dividing n. Any 
primitive solution of (1.1) in K[t] consists of constant polynomials. 


Proor. Without loss of generality, I may assume K algebraically closed. 
Suppose that (1.1) admits a primitive solution (f,g,h), such that the 
maximum m of the degrees of f, g, h is positive. Among all such solutions, 
I consider one for which m is minimum. 
K contains a primitive nth root ¢ of 1, in view of the hypothesis. Let 


n-1 
g' =r- ft = T] 6- of) 
j=0 
Since the polynomials h — Or are pairwise relatively prime and K[t] is 
a unique factorization domain, each one is an nth power: 


h—Cf =g}, whereg;eKlt]. 


But h — f, h — Cf, h — £?f are in the vector space generated over K by 
h, f. So there exist ao, a), a, € K, not all equal to 0, such that 


aah — f) + ailh — Cf) + alh — EP) = 0. 


Actually, these coefficients have to be all distinct from zero, because f, h 
are relatively prime. 
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Put k; Alogge € K[t]. Then 
otk +k, =0 


with deg(k,) < m (j = 0,1,2), and max {deg((ko), deg(k,), deg(k,)} > 0. This 
contradicts the choice of m. o 


Pierre Samuel showed me this proof, which is more sophisticated. If 
f" +g" = h", where f, g, he K[t] are relatively prime nonconstant poly- 
nomials, then (f/h,g/h) is a generic point of Fermat’s curve X" + Y" = 1. 
Hence K(f/h, g/h) = Ku, By Lüroth’s theorem, Fermat’s curve is rational, 
that is of genus 0, and this implies that n = 1 or 2. 


2. Variation II (In the Tone of Entire Functions) 


This time, the search is for solutions in entire functions. But I'll not consider 
the question in its entirety (never mind the pun). I'll just pick a few striking 
results and proofs. 

Consider the equation 


X?+Y?=1. (2.1) 


An obvious nontrivial solution in entire functions is f(z) = cos z, h(z) = 
sin(z). Similarly, if h(z) is any entire function, then cos(h(z)), sin(h(z)) is a 
solution in entire functions. But conversely, Iyer proved in 1939: 


(2A) If f(2), g(z) are entire functions satisfying (2.1), then there exists an 
entire function h(z) such that f(z) = cos(h(z)), g(z) = sin(h(z)). 


Proor. From f(z)? + g(z)? = 1 it follows that [ f(z) + ig(2)](f(2) — ig(z)] = 1. 
Thus the two factors are entire functions without any zero. Then there exists 
h(z), an entire function such that 


SO + igi) =e" f(z) - ig(z) =e ™. 
Therefore f(z) = (e”® + e" ®)/2 = cos(h(z)) and g(z) = (e*? — e" *)/2i = 
sin(h(z)). o 
Now I consider the case where n > 3 and instead of (2.1), the equation is 
X” + Y" = p(z), (2.2) 


where p(z) is a nonzero polynomial function of degree at most n — 2. The 
following result was obtained by Gross in 1966. The special case where 
p(z) = 1 was first done by Iyer in 1939, and then by Jategaonkar in 1965: 


(2B) If n > 3, if p(z) is a nonzero polynomial of degree at most n — 2, if 
f(z) and g(z) are entire functions such that f(z)" + g(z)" = p(z), then f(z), 
g(z), and p(z) are constants. 
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ProoF. UU give the proof under the stronger hypothesis that p(z) has degree 
at most n — 3. For the case where p(z) has degree n — 2 the proof is somewhat 
more technical. 

Let £ be a primitive 2nth root of 1. Then 


Ka £ i (Bs ont) 


g(z) 


The meromorphic function p(z)/g(z)" has at most n — 3 zeroes. So, at 
least three factors of the right-hand side never vanish. Thus, the mero- 
morphic function f(z)/g(z) misses three values. By the well-known theorem 
of Picard, f(z)/g(z) is a constant. 

Now, it is anticlimatic to conclude the proof of the statement. Oo 





Obvious corollaries are the following: 


(2C) If n 2 3 and f(z), g(z) are entire functions such that 
f(z)" + ga)" = 1, 


then f(z), g(z) are constants. 


(2D) If n> 3, if f(z), g(z), h(z) are nonzero entire functions such that h(z) 
never vanishes, and if 


F(Z" + gi)" E 


then there exist nonzero complex numbers a, b such that f(z) = ah(z), g(z) = 
bh{z), a" + b" = 1. 


3. Variation III (In the Theta Tone) 


In the preceding variation, it was quite exciting seeing that the (little) Picard 
theorem provided the key for the proof. 

This time, the search is for solutions which are analytic in the interior 
of the unit disk. And the exponent is just equal to 4. 

In 1829, Jacobi discovered a solution for 


X* + Ki = Z*+ 


in theta functions. Since these functions play quite an important role in 
various branches of arithmetic, analysis, and geometry, P1 take the time 
to introduce them and indicate the main steps towards Jacobi’s result. 

It is convenient to introduce the theta functions as solutions of certain 
functional equations. 

Let t belong to the upper half-plane and let q = e*", so al < 1, that is, 
q is in the interior of the unit disk. 
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Let k > 1 bean integer, let t be given in the upper half-plane, let b = b(t) e C, 
b #0 and consider the vector space V (depending on k,t,b) of all entire 
functions f(z) satisfying 


fz+m=f(2 f(z + nt) = be "Mertz (3.1) 
(3A) V has dimension k. 


SKETCH OF THE PROOF. Let Noe e V. For every integer n ¥ 0, 


" fe —2niz dz 


e 
" —n/2 


is the coefficient of the outer expansion 3 7. ,, ce?” of the periodic func- 
tion f(z) in the interval (— 2/2, 2/2). 

For every integer h # 0, consider the rectangle C,, with vertices — 7/2, 
7/2, (n/2) + ih, — (7/2) + ih. By — theorem and the hypothesis (3.1) 
Ca = eh > > fi (z — ih)e?"" dz. 

Thus 
|en] < e?""nM,, 
where 
M,= max {|f(z-— ih}. 
—n/2sz<n/2 

Writing t =u + iv (where u,v are real, v > 0), writing h/x = mv +r, 
with 0 < r < v, and man integer, and writing mu = s Lu, with -4< u < 4, 
and s an integer, it is not difficult to show that 


M, a br ehn 4 


where 


A=max {e’"|f(z+ n(u' — ir))]}. 
-n/2<sz<a/2 


From these majorations, it is possible in a more or less standard way, 
to deduce that the Fourier series is absolutely and uniformly convergent 
in any bounded region. Thus 


f) = 5 er 


n=- o0 
The second functional equation of f(z) gives relations between the 
coefficients of the Fourier expansion, 


Catk = b- e 
Hence, if n = sk + r, with integers r, s,s > 0,0 <r < k, by iteration 
Cn = Cra sk = Le tae 
This relation holds also for s < 0. 
Let 
f(z) = e?riz > b’gzter + 480s 1k) 22skiz, 


s= — o 
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From the uniqueness of the Fourier expansion, it follows that f(z) is, 
in a unique way, a linear combination of the functions fo(z), fı(z) - - - » Ae—1(2), 
which belong to the vector space V, and therefore constitute a basis. oO 


The special case where k = 1 and b = q~' gives the function denoted by 


Ka 
O(zg= A ge 


= 1 + 2qcos2z + 2g*cos4z + 2q? cos6z +". (3.2) 
Taking k = 1 and b = —q7! gives the function 


O,(zq)= A L-Urgeit 


SO 


= 1 ~ 2gcos2z + 2g*cos4z — 2q?cos6z+°°° . (3.3) 
Similarly, considering the functional equations 
Se+m)=-fl2) f(z + at) = be f(z) (3.4) 


it is seen that the space of such functions has dimension 1, and a generator is, 
for example, the function 


f*(2) = 3 bo Sq est Diz 
Taking b = —q™', respectively b = q7 1. gives the functions 


Ka 


9 ,(2,9) = —igi!* >; (— Uer test 1)iz 


s= = 0 


= 2q"*(sin z — q? sin 3z + gf sin 5z — q'°sin7z +++) (35) 


9,(2,9) = ail? 3 që test iz 


= 2q'!*(cosz + q? cos 3z + qf cos 5z + q? cos7z +++). (3.6) 


The functions G/zal (i = 1,2,3,4) are Jacobi theta functions. I denote 
also 6,(q) = @;(0,q4), so these are functions defined in the interior of the 
unit disk. 

These functions are of course mutually related. For example, for every q 
in the interior of the unit disk: 


9, (: + 7 d = 0,(z,q), ©; Lk + >> a) = —0,(2,q), (3.7) 


T 
9; (7 + 5 a) = ,(z,9), 0, (: + J? a) = 0 (2,9), (3.8) 
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m E Tt ee 
O, (: tiz a) = iqg™*!*e~#@,(z,q), O, (: +t, a) = ig "e "70 ,(z,q) 


2 
(3.9) 
vin z 
el +15, a) = q7 eg (2,9), el + LG a) = q7 tte 702) 
(3.10) 
and also 
9,(0,9) = 0. (3.11) 


Between the squares of any three theta functions, there are also linear 
relations: 


(3B) The following four linear relations are satisfied: 


O5(0,q)O7(z,q) + O30,q)O3(z,q) = ©3(0,q)O3(z,q), (3.12) 
A3(0,q)O4(z,q) + O5(0,q)O3(z,q) = O3(0,q)O2(z,q), (3.13) 
A3(0,q)Oi(z.q) + O%(0,q)O3(z,q) = O3(0,q)OHz,q), (3.14) 
— 930,)0%2,9) + O3(0,q)O3(z,q) = 02(0,q)O2(z,q). (3.15) 


Proor, I'll prove only the first relation. 
The squares of the theta functions satisfy the functional equations 


fetr =f) fle+nt)= ge *ef(z). (3.16) 


By (3A) the space of entire functions satisfying these equations has dimen- 
sion 2. So, the squares of any three theta functions satisfy a nontrivial linear 
relation. 

If a, b, c are complex numbers such that 


aO 5(z,q) + bO3(z,q) = cO4(z,q) 
computing these functions at z = 7/2, t(n/2), (7/2) + t(n/2) and taking into 
account the relations indicated above yields 
a03(0,q) + bO7(0,q) = cO%0,9), 
—aq~"@4(0,q) + bq~*?@3(0,q) = cq"? 3(0,q), 
og *!°@3(0,q) — ba 040,9) = — cq” "7 O7(0,q). 


But ©,(0,9) = 0, @,(0,q) # 0, hence 








04(0,9) SEH 
“= 630.9)" ag WRC #O. 
This gives the first relation. g 
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It follows from relation (3.15) for z = 0 that 
03(0,q) + 940,9) = 03(0,) (3.17) 
and this may be rewritten as: 


(3C) The theta functions 0,(q), 64(q), 93(q) satisfy the relation 


63(q) + 04l) = 03(q) 
in the interior of the unit disk. 


The proof whose main points I sketched may be found in the book of 
Bellman (1961). Another completely elementary proof has been published 
by van der Pol in 1955. 


4. Variation IV (In the Tone of 
Differential Equations) 


In 1962, Rodrigues-Salinas looked at the following differential equation, in 
analogy to Fermat’s equation: 
u õu u 
BREI RE RR 41 
So äu" a GL) 

There is, surprisingly, a relationship between the solution of Fermat’s 

equation 

xX" + Y"=Z" (4.2) 
and the solution of the differential equation in functions u(x,y,z), satisfying 
certain periodicity and boundary conditions. 

Let Q be the unit cube in the space Hi. that is, the set of triples (x,y,z) 
such thatO<x<1,0<y<1,0<2z< 1. Suppose that u(x,y,z) is a real- 
valued function of three real variables, with continuous partial derivatives 
up to the order n. Assume also 


a. u has period 1 in each variable. 


b. There exists M > 0 such that, for any x, y, z: 
<M, a <M. 
oy" 


2) (3 














Ou 
ES M, isn 





c. u is a solution of (4.1). 


(4A) With the above hypotheses, the Fourier expansion of u(x,y,z) is 
convergent: 


oo 
u(x, y,Z) = 3 Aue +qy+rz) 


Par 


and if p" + q" # r", then A,,, = 9. 
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The next results distinguish the cases where n is odd or even. To state 
these we consider two further conditions on a solution u: 


d. u(x,y,z) vanishes on any of the faces of unit cube ©. 
d u(x,y,z) vanishes on two faces of Q. 


(4B) If n is odd and if u(x,y,z) satisfies the conditions (a), (b), (c), and (d), 
then u(x,y,z) is identically zero. 


(4C) If n = 2 is even, the following are equivalent: 


1. There exist nonzero integers a, b, c such that a" + b" = ch. 
2. There exists a function u(x,y,z), not identically zero, satisfying (a), (b), (c), 
and (d’). 


TU not try to sketch the proof of these results, since they may be found 
in the paper of Rodrigues-Salinas. Let me just say that if a? + b? = c?, then 
u(x, y,Z) = (sin 2nax)(sin 2rby)(sin 2rcz). 


5. Variation V (Giocoso) 


Great tragedies often have some scenes of humor, tense musical symphonies 
their moments of gaiety. Mathematicians of the most serious kind also like 
to be amused. 

While collecting information about Fermat’s theorem, I gathered various 
items which I have decided not to hide. 

To begin, I wish to quote a paper of Orts Aracil (1961). He said: 


As Fermat tried to find the solutions of X” + Y” = Z” in positive integers, 
he must have considered first the easier cases n = 3, n = 4, obtaining the result 
that no such solutions exist. In view of this, he decided to tackle directly the 
general equation and begun searching whether the equation admits solutions 
of the form 


where a, p, q, r are positive integers. 
He required the equality 


ert + (at = (a) 
that is 
a’? + a" = om 
which may be written also: 
(a) + (a") = (a’y, 


and putting a” = m, 
m + mt =m. 
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Previously, in the paper, Orts Arcil considered the dual of Fermat’s 
equation 


and he proved: 


(5A) If n > 3, there are no positive integers x, y, z such thatn + w = H. 
However, for n = 2, if 2% +2? = X, then x = y,z= x +1. 


He says: 


Coming to the theme proper of this article, which constitutes the key of 
our conjecture, let us observe that if in Fermat’s equation 


x" + y” = zZ" 
the bases and exponents are permuted, it becomes 
nm +n =n, 


an equation to which, icuriosa coincidencia!, exactly and literally the same 
statement may be applied: this equation has no positive integer solutions 


ifn > 3. 
And even more curious, is that the proof in this case is immediate, at the 
level of any student of the first courses of mathematics at the university. 


He adds that he proved this point experimentally. And he concludes: 


This result [(5A)], no doubt known to Fermat, might have induced him, 
“sin mäs” to formulate his famous theorem. 


“Caramba,” it might have been indeed a “curiosa coincidencia”. One 
which spurred a great progress to mathematics. 


6. Variation VI (In the Negative Tone) 


In 1967 and 1968, Thérond considered the equation X" + Y" = Z" where n 
is a strictly negative integer. And he proved: 


(6A) If Fermat’s theorem is true and if n < —2, then the equation xX" + 
Y" = Z" has no solution in nonzero integers. 


(6B) The equation X~!+Y~1=Z~' has primitive solutions (x,y,z) 
which are of the form 
x = a(a + b), y = b(a + b), z = ab, 


where a, b are nonzero integers, gcd (a,b) = 1. 





EE 
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(6C) The equation X~? + Y~? = Z~? has primitive solutions (x,y,z) which 
are of the form 


x= 2ab(a* + b°), y= (a° ~ b?)(a? + b*), z= 2ab(a? — b2), 


where a, b are nonzero distinct integers such that gcd(a,b) = 1. 


7. Variation VII (In the Ordinal Tone) 


In 1950, Sierpiński proved that the last theorem of Fermat is false for ordinal 
numbers: 

For every ordinal number p, there exist distinct ordinal numbers a, ß, y, 
each larger than u and such that 


a" +ß"=y" forn=1,2,3,.... 
This is the first of various similar results showing that Fermat’s equation 
x4 + y+ _ Zi 


where A is any ordinal number, also admits solutions which are not trivial. 
Once more, it is brought to light how strange is the arithmetic of ordinal 
numbers. 


8. Variation VIII (In a Nonassociative Tone) 


Nonassociative arithmetics? Yes, they have been invented, and perhaps soon 
they may enter into mathematics with some unexpected applications. While 
this day has not yet arrived, it is nevertheless fortunate that mathematicians 
know whether their Fermat’s last theorem is true or false in a nonassociative 
arithmetic. 

I suspect that not everyone is familiar with this new theory. So, I will 
expose its general ideas. 

I trace the first paper to Etherington in 1939 (and again in 1949). Under 
another name, they were considered also by A. Robinson (1949) and Evans 
(1957). It was Evans who proved Fermat’s last theorem in such nonassociative 
arithmetics; see also Minc (1959). 

It is best to follow the method of Peano, used to define the natural numbers. 
Let S be a set of elements which will be called nanumbers (nonassociative 
numbers), satisfying the following conditions: 


a. There is a nanumber, denoted by 1. 
b. There is a binary operation + between nanumbers. 
c. There are no nanumbers a, b such that a + b = 1. 
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d. Ifa, b,c,darenanumbers anda + b = c + d, thena = c,b = d. 
e. (Principle of nonassociative induction): If S’ is a subset of S, such that 
1 e S’ and S’ is closed under addition, then S’ = S. 


The nanumbers are therefore represented as 1, 1+1, 1+(1+ 1), 
(1 +1)+1, 1+0 +(1+1), 1+(1+1)+1}), €+1)+(1+1,(1+1)+ 
1)+1,(1 +(1 + 1)) +1, and so on. 

The natural nanumbers are 1, 1 + 1 denoted 2, and similarly 1 + (1 + 1) = 
1+2=3,1+3= 4, ete... 

Each nanumber has a length, defined as follows: |1| = 1; if a=b +c, 
then |a| = |b| + |c|; this is well defined in view of the axioms (c) and (d). 

I will now state various properties: 





1. Ifa + b= b +a, then a = b. 
2. Foralla, b: a #a +b. 
3. For all a, b, c: a + (b + c) Æ (a + b) + c. 


The multiplication of nanumbers is defined as follows: 


al=a and a(b + c)= ab + ac. 


Clearly |ab| = |a|.|b|. Among the properties, I note: 


4. If ab = 1, then a = 1 and b = 1. 
. la = a for all a. 

. a(bc) = (ab)c for all a, b, c. 

. If ba = ca, then b = c. 

. If ab = ac, then b = c. 


X NDA 


From the associative property of multiplication, it is possible to define 
unambiguously exponentiation, where n is a natural number: a! = a, 
a" = aa! = OT ta, 

If a = bc, then b is called a left factor of a and c is a right factor of a. If, 
moreover, b, c are different from 1, a, they are called proper factors. If a # 1 
has no proper left factor, then it is called a prime nanumber. It follows that 
it has no proper right factor. 


9. Ifc is a proper left factor ofa + b, then c is a left factor of a and of b. 
In particular, 1 + b, a + 1 are primes. 
10. If the prime c is a left factor of ab, with a # 1, then c is a left factor of a. 
And now, the fundamental theorem of the arithmetic of nanumbers. 


11. Every nanumber is, in unique way, a product of prime nanumbers. 
12. Ifa prime nanumber p is a right factor of ab, and b # 1, then p is a right 
factor of b. 


With all this preparation, I give the proof of Fermat’s last theorem for 
nanumbers: 
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(8A) If n is a natural number, n > 2, then the equation X" + Y" = Z" has 
no solution in nanumbers. 


Proor. Assume that x, y, z are nanumbers such that x” + y" = 2”. 
Note that x 4 1, otherwise z” = 1 + y" would be prime [by (9)], which is 
not true, because n > 2. Similarly, y # 1. I note also that |x|" + |y" = Ex, 
Since z is a proper left factor of z" = x" + y", by (9) z is a left factor of x” 
and of y". By the fundamental theorem (11), 





x= PPr... Ps (s21, 
z=4142:.:.4 (t21) 


Since zu = x” (for some nanumber u), q; °° qt = PıPa ''" DAN 

If t < s, by the uniqueness of decomposition into a product of prime 
nanumbers, q1 = Pis - - 4: = Po hence x = zp,41 °** Ps 30 |x| = |z| |ps+ ı| 
\p.| = |z|. On the other hand, |z|" = |x|" + |y > |x|" so |z| > |x|, which is a 
contradiction. 

Therefore t>s and q,=P»---,9=P, SO z=xa, where a= 
qs hL. 

A similar argument gives z = yb where b # 1. Therefore |z|" = |x|" + ||" 
= (|2|"/lal") + (|z]”/)b]”), hence 1 = (1/Ja|") + (1/|b|"), where el, |b] > 1. This is 
clearly impossible. m 


n—1 








9. Variation IX (In the Matrix Tone) 


Not much is known concerning the solution of Fermat’s equation in square 
matrices. The question is trivial if singular matrix solutions are allowed. 
Indeed, if k > 2, if 


ch rbl Chi 
ck ri Cla 


are k x k matrices, then they are idempotent, A + B = Č, so A" + Br = CH 
for every n > 1. This was explicitly noted by Bolker in 1968 and, in special 
cases, by Domiaty in 1966. 

Hence, I shall only be interested in solutions in nonsingular matrices. 
Bolker proved the first result of any interest: 


and if 


(9A) Let R be a commutative ring with unit element. Let m > 1 and assume 
that there exist nonzero elements x, y, ze R such that x" + y" = z". Then 
there exist nonsingular n x n matrices A, B, C over R such that 


Am + Bm = Cm", 
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PROOF. If z is any permutation on n letters let P = p(x) be the n x n matrix 
over R: 


P= (di 


that is, the entry of P position (i, j) is ö,«),; (the Kronecker ö-symbol): 
If z is any circular permutation, p(z) = P, let 


A = diag(x,1, ...,1)P, 
B = diag(y,1,...,1)P, 
C = diag(z,1,...,1)P. 


Then the matrices A, B, C are nonsingular and satisfy Fermat’s equation for 
the exponent nm. oO 


Taking m = 1 and nonzero elements x, y, z such that x + y = z, then the 
above construction gives n x n nonsingular matrices A, B, C such that 
A" + B" = CH, 

In 1972, Brenner and de Pillis determined other solutions in nonsingular 
n x n matrices for the equation in (9A). These authors have also proved: 


(9B) If A, B, C are nonsingular n x n matrices with entries in Z, if AB = BA 
and if A" + B" = C”, where m > 1, then there exists a nontrivial triple of 
algebraic integers a, b, c, each of degree at most n and such that a” + b" = e. 


This follows at once from the existence of a common eigenvector for the 
commuting matrices A, B. 
A partial converse, proved by Brenner and de Pillis is the following: 


(9C) If a, b, c are integers in a quadratic number field, such that a” + b™ = 
c™, where m > 2, then there exist nonsingular 2 x 2 matrices A, B, C, with 
entries in Z, such that Am + B™ = C". 


In 1961, Barnett and Weitkamp studied in more detail the solution of 
Fermat’s equation in 2 x 2 nonsingular matrices with rational coefficients. 
This is their main result. 


(9D) Let n > 2, n #4. If Fermat’s theorem holds for the exponent n (when 
n is odd) or for the exponent n/2 (when n is even) if A, B are nonsingular 2 x 2 
matrices over Q, which are not scalar matrices but such that A", B" are scalar 
matrices, then there exists no 2 x 2 nonsingular matrix C, with entries in Q, 
such that 
A" + B" = C". 


Barnett and Weitkamp gave also many examples of matrices A, B, C such 
that A” + B" = C": 
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1. Take 
-1 -1 
en) 
gi >) n>3,n=3(mod4). 
2. Take 
-3 -5 2 
al 2 d sl i): 
di 3) n> 1,n = 1 (mod 4). 


In the above examples the matrices have fourth power equal to the identity 
matrix. 


3. If r is any rational number, and if 


2 
dë 1 
A= 
3 9 3° 


then 4? + B? = CH. 


And there are more examples and methods, but TU not discuss this matter 
any further. 


10. Fugue (In the Quadratic Tone) 


After all the variations, the long awaited fugue comes and it is written in the 
quadratic tone, which most suits it. What I really mean, is that I’ll now 
consider the solution of Fermat’s equation in algebraic number fields. And 
here again, the trivial cases have to be excluded. For example, 1” + 1" + 
(z/—2)" = 0 tells us that in OG) 2) the equation 


X"+ ¥"4+ Z"=0 (10.1) 


has a nontrivial solution. So, the problem is interesting only when the 
algebraic number field has degree less than the exponent. 
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How about the simplest situation: To find out whether (10.1) has a solution 
in quadratic number fields? This study has been done only for a few ex- 
ponents: n = 2, 3, 4, 6, 9. 

If n = 2 it makes a difference to consider (10.1) with exponent 2, or the 
equation X? + Y? = Z?. This equation has nontrivial solutions in ordinary 
integers, but it may be asked whether there are solutions of the type 
(a+ b,/m, EES b,/m, c), where a, b, ce Z, m is a square-free integer, m # 0, 1. 
Aigner answered this question in 1934: 


(10A) There exist nonzero integers a, b, ce Z such that 


(a + b/m? + (a — bym? = c? 
if and only if m has no prime factor p such that p = +3 (mod 8). 


On the other hand, for the equation X? + Y? + Z? =Q, it is only in- 
teresting to investigate the solutions in an imaginary quadratic field. The 
following result was first proved by Nagell in 1972; a simpler proof was 
given by Szymiczek (1974): 


(10B) If m > 0 is square-free, then the equation X? + Y? + Z? = 0 has a 
nontrivial solution in the imaginary quadratic field Qt —m) if and only if 
m+ —1 (mod 8). 


Another way of looking at this result is the following. The level of an 
imaginary field is the smallest number s of squares such that — 1 is the sum 
of s squares. Since the field is imaginary, such number s exists, as was shown 
by Artin and Schreier (1927). Thus, the above theorem says that the level of 
Q(./ — m) is at most 2 if and only ifm Æ — 1 (mod 8). 

Aigner settled the case of exponent 4 in (1934); a new proof was given by 
Fogels in 1938: 


(10C) The equation X* + Y* = Z* has a nontrivial solution in the field 
Ota (where m is a square-free integer, m= 0,1) if and only ifm = —7. 


In this situation, every nontrivial solution is equivalent (that ts pro- 
portional with a nonzero multiplier) to one of the following solutions (with 
arbitrary signs): 


(40 +./—7, +01 — /—7), +2). 


The proof is somewhat elaborate, but involves no real difficulty. 

Since I’m not giving the proofs for the exponents 6, 9 (which depend on 
the case of the exponent 3), I may as well state the results right now. Once 
more, they were obtained by Aigner in 1957: 


(10D) If m # 0, 1 is any square-free integer, then X° + Y° = Z° has only 
the trivial solution in Q,/m). 
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(10E) If m # 0, 1 is any square-free integer, then X? + Y? = Z? has only 
the trivial solution in Oe, 


The proof of (10D) is also based on a result due to Nagell (1924): 
X?—1=3Y? (10.2) 
has only the trivial solution (1,0) over the field © of rational numbers. I stress 
that there are no other rational solutions. It is already not totally easy to 
show that there are no other integral solutions of (10.2) than (1,0). See a proof 
given by Oblath (1952). 

Concerning the rational solutions, the analogous equation X? + 1 = Y? 
had been treated already by Euler. Nagell has in fact considered the equation 
X?’ + 1 = DY?, where D # 0 is an arbitrary integer. Taking D = —3, the 
equation is equivalent to (10.2). 

These considerations illustrate the fact—to be expected—that to solve 
diophantine equations over quadratic fields, leads to the solutions over Q, 
or Z, of some associated diophantine equations. 

Let me aiso remark that (10D) and (10E) would be automatically true, 
if the Fermat cubic 

X?+Y3+Z3=0 (10.3) 
had only the trivial solution. But, as I shall indicate, this is far from being the 
case. The questions become: for which quadratic fields does (10.3) have 
nontrivial solutions, and in this situation, to describe all possible solutions. 

To fix the terminology, two solutions (x,y,z) and (x’,y’,z’) in a field K are 
K-equivalent if there exists a €e K, a # 0, such that x’ = ax, y = ay, Z’ = az. 
A solution (x,y,z) is called a quadratic solution if x, y, z belong to some qua- 
dratic extension of Q. 

In 1915, Burnside described all the nontrivial quadratic solutions of 
Fermat’s cubic; they may be parametrized by the rational numbers different 
from 0, — 1. This was rediscovered by Duarte in 1944, who gave a simpler 
proof. l 


(10F) 
1. If ke Q, k #0, —1, and 
x =3 + rer 
Yk = 3 — S= 3(1 + 4k), 
2, = 6k, 


then xp + yp + z = 0. 

2. Ifae Q, a= 0, if k’ = ak (with k + 0), then the solutions corresponding to 
k and K by the above method are equivalent if and only if a = 1. 

3. If (x,y,z) is a nontrivial solution in the quadratic field Q(./m), there exists 
ke Q, k #0, —1, such that 


m= —3(1 + 4k’)u?, (10.4) 
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where u is a rational number and (x,y,z) is Q(./m)-equivalent to the solution 
(Ku Dr Zu 


This parametrization falls short of the expectations, because k is a rational 
parameter. It gives of course many quadratic solutions, for example: 


k = 2 gives (3 + 34% — 11,3 — 3,/ — 11,12), 
k = 1 gives (3 + y — 15,3 — y — 15, 6), 
k = —2 gives (3 + /93,3 — ./93, — 12). 
On the other hand, it is not obvious at once that there is a solution in 
Q(./ —2), since it is not obvious that 
—2 = —3(1 + 4k’)u? 


has a rational solution k. But, in fact taking k = 4, u = 4, this gives the 


solution (2 + /=2,2 — /—2,2). 

In order to study the quadratic solutions of the Fermat cubic, it is helpful 
to remark that any solution is equivalent (in the field in question) to a con- 
jugate solution, which is one of the form 


x=a+ b/m, 
y =a — b/m, (10.5) 
Z=C. 


This was established by Fueter in 1930 and again by Aigner in 1952. 
And now comes a very crucial fact shown by Fueter in 1913: 


(10G) The Fermat cubic has a conjugate solution in Ota) if and only 
if it has a conjugate solution in Q(./ — 3m). 


ProorF. I give this proof, since I shall require the explicit formulas to pass 
from the solution in one field to the other. 
Let (10.5) be a conjugate solution in Om), Then 


—c? = 2a(a? + 3mb?). (10.6) 
Writing (a + b/m}? = «+ Ëm (with a, ße Z), then (a - b,/m)’=a- 


Dm so 


(a? — b’m)? = a? — Bim 


From x? + y? + z? = 0 it follows that 
a = ala? + 3b?m) = —$c°, 
B = ba? + bm), 


€ = 3a + Py Am 
n = 3a — BV — 3m (10.8) 


£ = 3c(a? — b?m) 


(10.7) 
Then 
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is a conjugate solution in Q(./— 3m). It may be verified that this solution 
is not trivial. 


To show the converse, I just note that O(,/m) = Q(,/—3(— 3m). m 


From a more sophisticated point of view, the above method of deducing 
solutions in Q(,/ — 3m) from solutions in Oto corresponds to the com- 
plex multiplication by „/—3 on the elliptic curve X? + Y? + Z? = 0. More 
information on this point may be obtained in Cassels’s survey article (1966), 
§§ 24 and 26. 

Repeating the procedure used in the proof of (10G) yields the following 
fact: 


(10H) If Fermat’s cubic has a nontrivial solution in the field Qim), then 
it has infinitely many (pairwise nonequivalent) solutions in this field. 


Į will not prove this, but illustrate with a typical numerical example. 
Beginning with the solution (2 + y —2,2 — y —2, 2) the proof of (10G) gives 


the solution 
(—6 + 5/6, —6 — 5,/6, 18). 
Repeating the procedure, I obtain the new solution 
(— 4374 + 1935,/ — 2, — 4374 — 1935,/ — 2, — 3078). 


It may be shown that the new solution is never equivalent to the given one. 

From these considerations, the main question is to decide whether in a 
given field Oe) there exists a nontrivial solution. As I said, this amounts 
to solving (10.4) in rational numbers m, u. This task cannot be handled 
directly as such. 

From Fueter’s result, I may assume that 3.4m. Four cases are possible, 
according to the sign of m, and m = +1 (mod3). For some unexplained 
reason, the results seem very much to differ according to the case. In some 
of the cases the class number of the quadratic field plays a role, in others 
it doesn’t. I shall denote by H(m) the class number of the quadratic field 
Q(/m), where m is square-free, m # 0. In 1913, Fueter proved: 


(101 If m is a square-free integer, m < 0, m = —1 (mod 3), and if there 
exists a nontrivial solution for Fermat’s cubic equation, then 3 divides the 
class number H(m). 


Fueter’s proof involved a long analysis of behavior of primes in appro- 
priate extensions. He gave also actual examples of solutions in various 
fields. For example, in Q(,/ — 31), which has class number 3, he computed 
the following solutions: 

9 —31\3 9-—./-31\3 
CY CEB wre 


2 
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and 
(36 + 5./ —31)° + (36 — 5y —31)° + 427 =0. 


Incidentally, applying the method of (10H) to the first of the above solu- 
tions does not lead to the second one. So, it is not true that the method 
indicated will provide all solutions from a given one. And one wonders how 
many “independent” solutions will suffice to describe all, after repeated 
application of (10H). This is still unknown. 

Fueter also considered the case where m < 0, m = 1 (mod 3) and he gave 
examples of solutions both when H(m) is a multiple, or not a multiple of 3: 

In Q(./—2), of class number 1, there are the solutions 


2+7 +2—./—277 +2 =0 
(16 + ./—2)? + (16 — AH + (— 20)? =0 


(once more the second solution is not derived from the first one by applying 


the method of (2H)). 
In Q(./ — 23), of class number 3, Fueter found the solution 


KC Sch + a= ==) + (—3)? =0. 


and 


In 1952, Aigner took up the case where m > 0. His first result makes 
essential use of an important theorem of Scholz (1932), which connects the 
3-rank of the class groups of Om) and Q(,/ — 3m). 

Let @ be the class group of Q(./m) (where m > 0) and let €’ be the class 
group of Q(./—3m). Let €; (respectively €) be the subgroup of elements 
of order dividing 3 in @ (respectively €). So @3, @3 are finite-dimensional 
vector spaces over the field F, with 3 elements. Let s = dim; (63), s = 
dim, ,(@3). Then Scholz proved: 


(10J) With the above notations 


s<s<ss+tl. 


In particular, if 3 does not divide H(— 3m), then 3 does not divide H(m). 

The above theorem is not an isolated fact, but a special case of Leopoldt’s 
reflection theorem (1958), explained in Lecture IX. 

Back to Aigner—he proved (1952): 


(10K) If m is a square-free integer, m>0 and m = 1 (mod 3), if there 
exists a nontrivial solution of Fermat’s cubic in Q(./m), then 3 divides H(— 3m). 


As an example of this situation, Aigner showed that (486 + 11,/58)? + 
(486 — 11/58)? — 630° = 0; in this case H(— 174) = 12. 

Next, he observed that if m > 0, m = —1 (mod 3), then the existence of a 
nontrivial solution is independent of the divisibility of the class number by 





10. Fugue (In the Quadratic Tone) 283 


3, as the following examples illustrate: 
In Q(./2), with H(-6) = 2: 


(18 + 17,/2)3 + (18 — 17/2} — 423 =0. 
In Q(,/82), with H(— 246) = 12: 
(2 + ./82)? + (2 — /82)3 - 10° =0. 


The situation being fairly mysterious, Aigner proceeded to investigate 
more deeply the problem in a series of very interesting papers (1952, 1955 
1956). His first result is the following: 


> 


(10L Let m be a square-free integer, 3m, m > 0. Assume the following 
condition: m is a product of primes q; such that q; = 1 (mod 3) and 2 is not a 
cubic residue modulo q;. Then Fermat’s cubic has only the trivial solution in 


Q(,/m) and in Q(./—m). 


For example, since 2 is not a cubic residue modulo 61 then Fermat’s 
cubic has only the trivial solution in Q(./ — 61); yet H(— 61) = 6. This shows 
that the converse of Fueter’s result (101) is false. 

Similarly, 2 is not a cubic residue modulo 67, so Fermat’s cubic has only 
the trivial solution in Q(/67); yet H{—- 201) = 12. This shows that the con- 
verse of Aigner’s theorem (10K) is false. 

Next Aigner improved his last result as follows: 


(10M) Let p be a prime such that p = 5 (mod 6) and € = 3/2 — 1 is nota 
square modulo p. Let m = 2pq, ` ' q,, with r > 0, where q; are distinct primes 
satisfying the conditions: q; = 1 (mod 3), 2 is not a cubic residue modulo q;. 
Then Fermat's cubic has only the trivial solution in Q(,/m) and in Q(,/ — m). 


These results are, however, only the most special cases of a theory de- 
veloped by Aigner in 1956. I cannot enter into any of the details, but I hope 
to explain clearly his results. 

The object is to find integers k which will guarantee that the only solution 
of Fermat’s cubic in certain fields will be trivial. Accordingly, an integer 
k>0 (not a multiple of 3) is an obstructing integer (=Unmöglichkeitskernzahl) 
for Fermat’s cubic when Fermat’s cubic has only the trivial solution in 
Q(/m) and in Q(y —m) for every m = kq, ::- q,, with r > 0, where q; are 
distinct primes satisfying the conditions: 


1. q; = 1 (mod 3) fori=1,...,7, 
2. 2 is not a cubic residue modulo gq.. 


Thus, the two preceding theorems say that 1 is an obstructing integer and 
if p is a prime, p = 5 (mod 6) and € = SCT — 1 is not a square modulo p, 
then 2p is an obstructing integer. 

To obtain results about obstructing integers, Aigner required consid- 
erations in the cubic field K = a2). This field has an integral basis 
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{1,2/2,2/4}. Every integer xe K is therefore uniquely written in the form 
a=a+ b3/2 + c2/4, with a, b, ce Z. This is abbreviated as oa = (a,b,c). 

The class number of K is 1. It has a fundamental unit e = 3/2 — 1. The 
norm of « = (a,b,c) is N(a) = a? + 2b? + 4c? — 6abc. Let & be defined by 
«& = N(x). Then & = (a? — 2bc, 2c? — ab, b’ — ac). 

An algebraic integer « = (a,b,c) is even when 2|a, that is, CH divides o. 
It is odd when 2 fa, that is, 3/2 Y a. If « is odd, then either « = ¢ (mod 3/4) 
or a = 1 (mod 2/4). This latter case happens if and only if 2 Fb and then o 
is called a primary odd integer. If x, $ are primary, so are af and g. 

The following rational integers play a role: dal = a? — 2bc, d,(a) = 
b? — ac, and d3(a) = 4c? — 2ab. 

A first result is the following: If « = (a,b,c) is a primary integer, then the 


Jacobi symbol 
d(x) NW 
Kal Gen 


Next, Aigner showed that if « = (a,b,c), a = (a’,b’,c’) are square-free rela- 
tively prime primary integers such that N(«), N(a’) are relatively prime 


square-free integers, then 
a\ [a 
DER 


This is a reciprocity law, which allowed Aigner to define a symbol T(p/q). 

Namely, first let p, q be distinc primes, p = 5 (mod6), q = 5 (mod 6). 
Then there exist primary numbers «, ai, unique up to square factors, such 
that o divides p, oa divides q. Since 


One) 


Aigner defined unambiguously 


vu 


So the symbol T(p/q) has value 1 or — 1, and 


0 


This symbol was extended by Aigner for all pairs of primes, irrespectively 
of their residue modulo 6, but DU not enter into this definition. 
Then Aigner proved (1956): 








(10N) If p, q are distinct primes, such that p = 5 (mod 6), q = 5 (mod 6) 
and 
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then, k = pq is an obstructing number for Fermat’s cubic. 


For example 1135=5 x 227 and 1927=41 x 47 are obstructing 
numbers, so Fermat’s cubic has only the trivial solution in Q(,/ +1135) 
and in Q(,/ + 1927), irrespective of the values of the class number. 

On the other hand, T$) = +1 and there is the solution in Q(./85): 


(1 + 785)’ + (1 — V85) — 8° =0. 


For several primes p;, p; = 5 (mod 6), when is k = p; ++- p, an obstructing 
number? Aigner found a sufficient condition in terms of a graph I associated 
with k. The vertices of I’ are points corresponding to the primes Di, P, 
and labelled by these primes. There is an edge between p,, p; exactly when 


EI ef 
Pj 


A graph T is called an obstructing graph when the following condition 
is satisfied: If V # Ø is any set of vertices of I’, there exists a vertex v, such 
that the total number of edges from vg to the vertices in V is odd. (I note that 
Vo may be chosen in V.) 

Examples of obstructing graphs are the following: 


LI | N 


Examples of nonobstructing graphs are: 


(the boxed vertices constitute 
the subset V of the definition) 





In this context, the main theorem is the following: 


(100) If the graph of k is an obstructing graph, then k is an obstructing 
integer for Fermat's cubic. 
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So the problem becomes one of determining conditions for a graph to 
be obstructing. Aigner found the following graph-theoretical theorem: 


(10P) If T is an obstructing graph, then the number of vertices must be even. 


As a corollary: 


(10Q) If k is the product of an odd number of distinct primes d where 
q; = 2 or q; = 5 (mod 6), then k is not an obstructing number. 


A more precise analysis allowed Aigner to show: 


(10R) If k = 2 (mod 3), then k is not an obstructing integer. 


The following examples of obstructing integers are among the innumerous 
possibilities which illustrate Aigner’s results: 


2 41 
64042 = 2 x 11 x 41 x 71 has the graph 
11 71 
5 17 
21505 = 5 x 11 x 17 x 23 has the graph 
11 23 
2 17 
3910 = 2 x 5 x 17x 23 has the graph 
5 23 


I stop here, refraining from giving still other results from Aigner. At any 
rate, this should be enough to incite the reader’s curiosity about this problem. 
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Epilogue 


There is no epilogue. The search continues. New methods are invented, 
which will in turn be applied to other problems. Or, it is just the reverse. 
And this is the best that could happen, for it is the probing and search of 
such profound questions that nourishes mathematics. 
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second case 4,35 
numbers 2,3, 154 
quotient 22, 63, 152 
Frobenius criterion 159 
Furtwangler’s theorems 169, 170, 182 


Galois group 80, 98, 168 
T-extensions 211 
Gauss 
quadratic reciprocity law 167 
sums 96 
Genus 213 
Gunderson’s estimate 23, 231 


Hilbert’s tenth problem 217 
Hecke’s theorem 182, 184 
Hensel’s lemma 251 
Hypothesis H 56 


Ideal class 80 
Ideal class group 20, 80, 119 
p-rank of the 21, 29 
Ideal numbers 7, 27, 77 
Ideals 7, 27, 78 
conjugate 80 
equivalent 80 
fractional 77, 80 
greatest common divisor of 77 
least common multiple of 79 
norm of 80 
maximal 79 
prime 79 
principal 8, 78 
Independent extensions 182 
Index 117, 127 
Infinite descent 4, 37, 39 
Inkeri’s 
estimate 20, 228 
theorem 172 
Integral basis 78 
Irregular couple 109, 110 
Irregular field 
improperly 28, 188 
properly 28, 188 
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Irregularity 

criterion 105 

index 21, 22, 29, 109 
Iwasawa’s theory 210 


Jacobi ’s 
cyclotomic function 11, 118, 127, 167 
reciprocity law 68 
resolvent 118, 119 
symbol 67, 167 
sums 248 
Jensen’s theorem 10 


Krasner’s theorem 149 
Kummer’s 
congruences 12, 23, 122, 124, 140, 177 
conjecture 132 
cyclotomic function 127 
extensions 167 
lemma on units 8, 86 
main (or monumental) theorem 8, 86 


A-adic logarithm 126 
Largest known prime 24, 154 
Legendre’s 

symbol 55, 167 

theorem 6 
Lehmer’s (Emma) summation 

congruences 155, 156, 158 

LeLidec 

polynomials 148 

congruences 148 
Leopoldt’s 

conjecture 192 

theorem 184, 282 
Lerch’s summation congruences 156 
Local-global principle 251 
Logarithmic differential quotients 177 
L-series 95 
Lucas test 24 


Maillet determinant 131 
McDonnell’s theorem 170 
Mersenne numbers 20, 24, 27, 154 
Mirimanoff’s 
congruences 145, 178 
polynomials 143 
theorem 152, 169 
Möbius function 69 
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Mordell’s conjecture 214 
Mordell— Weil’s theorem 207 
Morishima’s criterion 160 
Moriya’s theorems 172 


Nanumbers 273 
Non-standard model 217 


Obstructing 
graph 285 
integer 283 


p-adic 

L-functions 191 

logarithm 192 

methods 99 

regulator 192 
Paris prize 3, 13 
Peano’s arithmetic 216 
Periods 10, 115, 247 
Picard’s theorem 266 
p-independent elements 180 
p-integral 103 
Pollaczek’s criterion 159 
Power residue character or symbol 166 
Primary integer 181 
Prime 

E-regular 203 

irregular 10, 106, 107 

ramified 79, 168 

regular 8, 21, 86, 90, 106 

Sophie Germain 56 

split 168 

unramified 80, 168 
Pythagorean triple 36 


Reciprocity law 
Artin and Hasse’s 173 
Eisenstein’s 167, 173 
Gauss’ 167 
Jacobi’s 68, 173 
Regularity criterion 99, 103, 108 
Regulator 82 
Residue power character 127 
Residue ring 79 
Resolvent 
Jacobi’s 118, 119 
Lagrange’s 117, 118 
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Riemann’s 
hypothesis for function fields 13, 247, 250 
zeta function 10, 94, 103, 104 

Roth’s theorem 233 


Schinzel’s conjecture 20, 27, 154 
Secant function 202 
Semi-primary integer 83, 166 
Sentence 
existential 216 
undecidable 217 
universal 216 
Siegel’s theorem 234 
Singular integer 171 
Sophie Germain’s 
primes 56 
theorem 5, 6, 55 
Stirling’s formula 104 
Sums of two squares 36 
Symbol 
Aigner’s 284 
Jacobi’s 67, 167 
Legendre’s 55, 167 
power residue 166 
Sylow subgroup 81, 86 


Takagi’s formula 175, 176 
Theory 

complete 217 
consistent 217 

Theta functions 268 
Totient function 69 
Twin primes 134 


Units 
fundamental system of 81, 98 
group of 76 
Kummer’s lemma on 86 
Kummer (‘‘Kreiseinheiten’’) 82, 98, 128 


Vandiver’s 


congruence for the first factor of the class 
number 184 

Fermat quotient criterion 159 

summation congruence 157 

theorem 170 


Waring’s problem 246 
Weierstrass points 213 
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Wendt’s 
determinant 13, 61, 62 
theorem 13 


Wieferich’s theorem 139, 151, 169 
Wilson’s 

quotient 155 

theorem 155 
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Wolfskehl prize 13, 14 


Zeta function 
Dedekind’s 94, 96 
Riemann’s 10, 94, 103, 104 
Zeta series, Dedekind’s 98 





